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_ DNA sequences of the EBV genome, recombinant DNA 
molecules, processes for producing EBV-reiated antigens, di- 
agnostic compositions and pharmaceutical compositions con- 
taining said antigens. 

Described are DNA sequences of the EBV genome cod- 
" ing for EBV-related antigens, recombinant DNA molecules 
containing said DNA sequences, vector/host systems for clon- 
~jding and expression of said DNA sequences, EBV-reiated anti- 
gens and methods for their preparation diagnostic and phar- 
^■'maceutical compositions containing said DNA sequences and 
antigens respectively (Fig. 2). 
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DNA sequences of the E3V genome, recombinant DNA molecules, 
processes for producing EBV-related antigens, dia- 
gnostic compositions and pharmaceutical 'compositions 
containing* said'-anti'gens * 

Technical field of invention 



This invention relates to DNA sequences of the EBV. genome 
coding at least for parts of EBV-related antigens- to be used in 
25 methods and diagnostic and pharmaceutical conpositicns referred to 
below and methods of localising and' isolating at least part of the 

DNA 



: Furthermore "the/iavention re*lates'--to* -recombinant DNA - 
° molecules i.e. -cloning and expression vectors*, useful for 
the prdducti6ir™of :r antigenic determinants: of . said EBV- 
-related 'antigens^af ter"in"troductiorr of these vectors 
.-- into .appropriate-ghosts : s - uch. as. jDacteria, yeasts and__ 
'mammalian cells - 
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Finally this invention relates to methods and compositions 
or kits,* respectively , for a rapid, simple, highly sensitive 
and highly specific determination of antibodies directed 
to EBV-related antigens. In these tests different antigens 
of EBV are used to detect specific antibody classes in 
the patient's serum, directed to these antigens. This detec- 
tion allows fairly reliable conclusions as to the status 
of infection of the serum donor such as preinfection, fresh 
infection, chronic infection, convalescence and neoplastic 
condition . Furthermore, this invention relates to pharma- 
ceutical compositions, e.g. vaccines containing said 
antigens useful for prophylaxis and therapy, of EBV-related 
diseases . 

Background Art 

The herpesviruses (Herpetoviridiae) are. enveloped ico- 
sahedral capsids with an overall diameter of 1 5o nm. The 
viral genome consists of ■ a double-stranded DNA with 

Q 

a molecular weight of approximately 1o D. Human herpes- 
viruses are Herpes simplex I ("fever blisters") , Herpes _ 
simplex II (genital herpes) , Varicella-Zoster, (chickenpox, 
shingles) Cytomegalovirus (congenital abnormalities, e.g. 
microcephaly) , and Epstein-Barr virus (EBV) (infectious 
mononucleosis (IM), Bur kitt • s lymphoma -(BL) nasopharyngeal 
carcinoma (NPC) . _ 

Herpesviruses display a- remarkable propensity. _£or establish- 
ing latent infections which may persist for the life of 
the host.. After, the primary infection the virus may remain 
quiescent,, being- demonstrable only sporadically or not 

at all, until it is reactivated by -one-of. ^several known 
types of stimulus, such as irradiation or immunosuppression. 
Such exacerbations of endogenous disease may take the form 
of a crop of vesicles on the skin in the case of herpes 
simplex or zoster, or more generalized effects in the 
case of cytomegalovirus or EBV. The capacity to persist 
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L indefinitely as a Latent infection enables these viruses 
to survive in nature for a long time.- During the last 
years, attention has turned to the correlation of human 
cancer and E3V . 

Eos tain-3arr- Virus (EBV) , infections and their conse- 
quences ' " " " 



E3V causes infectious mononucleosis as a primary disease. 
10 Predominantly it affects children or young adults. More 
than 9o % of the average adult population is infected by 

* 

EBV that persists for lifetime in peripheral B-lyraphocytes 
- The- virus is lifelong produced, in the parotid glarxi and spread 
via the oral route. 

15 

Serology 'suggests that ~EBV might be involved, in causing 
two neoplastic diseases of man, African Burkitt's lymphoma 
(BL) and nasopharyngeal carcinoma. (NPC) . Infectious mono- 
* nucleosis is a consequence of primary infection by EBV. 
20 It- is- not a life-threatening disease if additional risk 
factors are absent. 



25 



However, the subjective feeling of sickness, frequently 
for extended periods (in the order of several weeks) , and 
the necessity to avoid physical stress due to the drastical- 
ly increased risk of -spleenic rupture would certainly 
suggest a. control, of ... this disease. 

■ ---The-clinical" diagnosis of infectious mononucleosis' is 
3 0 

.•'usually -derived from a combination of '.the following para- 
- - . ••• meters t . -. - 

1-;- High., leukocyte count -ranging from 10,000 to 2o,ooo and 
reaching up to So^ooo 

2. To % atypical cells 

3 . lymphadenitis 

4. fever. 
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1 Patients with infectious mononucleosis 

shed E3V in their saliva. Virus shedding does not require 
special prevention against spreading the disease as epi- 
demics and infection of persons in close contact are rare 

5 (A.S. Evans, "The transmission of SB viral infections. 
Viral Infections in Oral Medicine.", edited fay J. Hooks, 
G. Jordan, Elsevier North Holland Amsterdam, p. 211 
(1982)). Virus shedding does not scop with recovery from 
disease and at least 60 % (possibly up to loo %) of the 

) adult population shed at least low levels of EBV which 
is produced lifelong in epithelial cells of the salivary 
duct of the parotid gland (H. Wolf, M. Haus , E. Wilmes , 
"Persistence of Epstein-3arr virus in the parotid gland", 
J. Virol. 51 (1984) ) . 

About. 1 % of the infectious mononucleosis cases show 
complications either already at the onset of the disease 
or as a late consequence. Most complications are due to 
autoimmune mechanisms and are in . some cases indiscernable 
from graft versus host disease, a mechanism by which 
the body might clear itself from the excess of EBV con- 
verted proliferating 3-cells. 

If the T-call response is insufficient, e.g. due to 
circumstances like treatment with high doses of Cyclo- 
sporin A in combination with corticosteroids or due to 
AIDS or a certain genetic predisposition. as described by 
?urtilo (Duncan's syndrome, X -chromosome-linked iyrapho- 
proliferative. .disease- (XL?) ; O.T. Purtiio-, _K. .SaJcamoto , 
V.. 3arnabei,j. Seeley,:.-T: Bechtolg, _G. Rogers, J. Yazz , 
S. Harada and the XLP-coliaborators : "E?stein-3arr virus- 
induced diseases, in boys .with.. the X-linked lympho-oro- 
liferative syndrome (XL?) . Update on studies_of . the 
registry." Am. J. Med. 73, p.. 49 ( 1 982 )), infected. 
3-cslls may have a chance to escape, from host control and 
grow without limitation as they would do when being 



G-1 7 3 2* 



5 

1 cultivated in vitro. The consequences have been described 
as BL-like disease in cases of AIDS patients (J.L. Ziegler, 
R.C Miner, E. Ros'enbaum, E.T. Lennette,' E. Shillitoe, 
C. Casavant, W.L. Drew, L. Mintz, J. Gershor, J. Green- 
5 span, J . Becicstead, K. Yamamo'to, "Outbreak of Burkitt's- 
like lymphoma in homosexual men.", Lancet 2, p. 631 (1982)} 
or as a polyclonal lympho-prolif erative disease for XL?" 
patients (D.T. Purtiio at al., supra) or kidney transplant 
recipients (D.W. Hanto, G . Frizzera, D.T. Purtilo, K. 
10 Sakamoto, J.L. Sullivan, A.K. Saemundsen, G. Klein, R.L. 
Simmons, J.S. Najarian, "Clinical spectrum of lympho-pro- . 
liferative disorders in renal transplant recipients and 
evidence for the role of Epstein-3arr virus . " , Cancer 
Res. 41 , p. 4253 (1981 ) ) . 

15 

The positive and fast identification of infectious mono- 
nucleosis or acute EBV infection is especially important 
in cases where a differential diagnosis'. to leukemia or; 
in case of transplant recipient, to graft rejection crisis 
20 is- necessary . In these cases, a false diagnosis may lead 
to incorrect therapy, which may have serious, even life- 
threatening effects. 

25 Prevention of primary disease caused by EBV 

Infectious mononucleosis seems to be unknown in areas like 
the .Philippines or Malaysia (D.S.K. Tan, "Absence o~f 
infectious mononucleosis among Asians in Malaya.", 

30 Med. "J. Malaya 21, p. 358 (1967)) where infection." by Z3V~ . 
occurs very early in life. Almost the whole'. population 

has antibodies at the ace of.' 2-1o years* at.""the latest. - 

' Clinical* symptoms seem to be a consequence "of "juvenile, 
or adult* Infection." It" can be assumed that" a "vaccine— " : 

ob primed organism will be infected without significant' 

clinical symptoms and that the consequences often fatal 
in the risk groups listed above could be eliminated by 
a vaccine. 




0173254 



6. 

1 Burkitt's Lymphoma and EBV 



The development of Surxitt's Lymphoma is l inke d to 
chromosomal rearrangements. Mot all cases contain E3V 
5 genomes in the tumor ceils. However, at least in ar-as 
with high incidence, 97 % of these neoplasias are 
E3V-related and a control of E3V infection is like'v 
to reduce the risk of developing Burkitfs lymphoma" 

10 Sasooharvnaeal carcinom a as a possible ■ second* rv 
disease" related to E3V 

The other disease where E3V shows a loo % association 
is nasopharyngeal carcinoma ( N PC) ("The Biology of 

15 Nasopharyngeal Carcinoma", CJICC technical report series, 
vol". 71, edited by M .J. Simons and K. Shanmuoaratnam, 
International Onion Against Cancer, Geneva, p. 1 (1982)) 
XPC most frequently starts at the fossa of Rosenmuell- 
(Recessus pharyngeus) at the postnasal space. Presently 

20_ patients are hospitalized only after the first tvoicai 
metastases have developed m the cervical Ivmoh" nodes. 
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In some areas of Southern China and amongst Chinese « a 
Singapore and Malaysia, NPC is the most frequent neoplasia 

or man with an incidence of up to *o ner in„ „„„ 

uu i_u >io per ioo,ooo oer vear 

In other parts of the world, like 3omeo or Tunesia the 
incidence is also high, m mosr other areas, the irc'd-ac- 
is around o,2 per loo,ooo per year which represents about 
30 4 %_ or ear, nose and throat (ENT) -tumors . The age dis- 
tribution shows a clear sinole -, Pa i, -^„„„- 

v-j.=_i singxe peaK arounc che age of Jo 

to 5o in almost ail high-risk areas, in Borneo and to some 
extent in Tunesia, a remarkable second peak has, however 

° ee:l . ° faserved at « early age ranging from 5 to 15 vears 
35 (M. J. Simons et al., supra). 
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Environmental factors including traditional Chinese 
medicine mav be resoonsible . f or the increased risk of 
nasopharyngeal carcinoma in certain, predominantly 
Chinese, ooouiations of Southern Asia (H . Wolf, "3ioioav 
of Epstein-3arr virus in: "Immune deficiency and cancer: 
Epstain-3arr virus and iymphoproiif erative malignancies 1 ' , 
ed. D. Purtilo, Plenum Press, p. 233 (1984)). 

Concrol of E3V-reIated neoolasia 



There are three possible basic strategies- to control 
neoplasia: 

1. Early detection followed by therapy, 

2. delay of onset of disease ideally beyond the average 
15 lifespan, and 

3. prevention. 

These goals may be achieved also in multifactorial 
diseases such as manv neoDlasias,— Incidence" of. disease 
may be reduced by—eliminating one or more of the essential 
factors which are not necessarily sufficient by them- 
selves to cause the disease, or by reducing factors which 
promote the manifestation of neoplastic .conditions . 
The use of the specific virus-related antigens of this 
invention, or antibodies or genetic materials as tools, 
for early diagnosis of virus-related tumors, might 
facilitate the elimination of essential factors. 



Selection of EBV-related gene products for diagnosis of 
E3V-related NPC 

A. Primary infection with EBV: Development of antibodies 
against VCA (viral capsid antigen) / EA (early- antigen) 
and EBNA (Eps tein-3arr Muclear Antigen) 



0173254 



8 

EBV infects B-lymphocytes during acute or primary in- 

■ 

fection (mononucleosis). Due to the laclc of immune re£ 
ponse, a number of cells enter into the lytic cycle 
and produce a full set of viral antigens which are shs 
into the blood stream during cytolysis. Against these 
antigens, specific antibodies will" be synthesized by 
the host's immune system (Table A) • 

Probably not all B-lymphocytes are capable of support: 
a fully lytic infection due to a cellular factor whic: 
prevents exDression of EBV. These cells are latentiv 
carrying EBV genomes for the rest of the host's life. 

Table A 



Disease 


VCA: 


IGG 


I CM 


IGA 


EA 


EBNA 


ma' 


NORMAL ADULTS 


+ 






* 


+ 




Acute adults 














(EARLY) 






+ 
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Chronic infection 


+ 
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Reactivation 


+ 
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XLP a 
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• 


NPC 
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+<D) 




++ i 


BL 




_ . ++ 








: 1 

! - ! 
t i 



1 XLP AS AN EXAMPLE OF IMMUNOLOGICALLY DEPRIVED HOSTS 

— . ... 'determined 3y immunoprecipltai.ion-OF._SP.. 2ftG/2CC 
(MA; membrane antigen) 

■ -- - - - - - — - 

B. Convalescence: Disappearance of antibodies against 
EA and maintenance of antibodies against VCA and I3NA 

As the immune defense mechanisms of the body remove th 
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lytically infected" cells from the circulation, the anti- 
body levels will start" to fall during the convalescent 
phase. After a certain period, anti-EA-antibodies dis- 
appear. However, as mentioned above7~EBV"is produced in 
the parotid gland. The viral particles and intracellular 
virus-associated antigens including EA will be shed into 
the saliva and reach the oropharynx. Here the viral 
particles bind to the B -lymphocytes and are presented 
to the body as antigens , thus the antibody titer against VCA 
is. maintained. Since EA cannot bincL to the. lymphocytes 
it will be degraded by poteases and therefore will not 
be. available, to the immune system, as., an:, antibody- inducing 
antigen. - 

The circulating lymphocytes -that are~latently- infected 
by EBV contain^ EBNA. At the end of their, life cycle these 
cells disintegrate and release *EBNA into the. blood stream. 
Therefore antibodies to this antigen will, persist . 

Thus , due to the EBV-production in the parotid gland and 
to. the release of EBNA from latently infected. B-cells, 
sera of convalescents will have iow an ti -VCA. and anti- 
EBNA IgG-antibody. levels (see Table. A, supra) ... . 
In addition- EA released frcm rare B - lymphocytes- which may enter a - ■. 
lytic cycle may. be. an inferior antigen and may not give rise to anti- 
body levels detectable with the test systems, used 

In combination with "the "known sequence of appearance, of . 
antibody classes 7 "specif ically the early "presence "of Ig~M 
antibodies followed by IgG antibodies, the various anti- 
"gen. classes of primary disease "caused "by EBVcan'^be *_ 
. utilized " for "improved diagnosti'd~p'rocedufes . However, 
available test systems which are mainly based on cellular 
antigens or cell derived antigens have serious limitations, 
■rnis concerns tne sensitivity, especially for detection of 
IgM antibodies and also unspecific reactions. 
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C. EBV-related a ntibodies in individuals suff ering of NPC 

The first suggestive evidence that E?stein-3arr virus 
might be causally related to nasopharyngeal carcinoma 
and African Burkitt's Lymphoma was derived from sero- 
logical data (for review see M. A. Epstein, a.G. Achong , 
"The E?stein-3arr Virus" Springer Veriag Berlin, Heidel 
berg, New York (1979)). 

Using mainly indirect immunofluorescence on calls pro- 
ducing virus or at least early viral antigens, signi- 
ficantly higher anribody titers to these antigens" 
were found inpatients' sera. These first tests which 
detected unspecified immunoglobulin classes against 
a group of proteins named Early Antigen (EA) and an- 
other group of proteins named Virus Capsid Antigens 
(VCA) were helpful for the establishment of a re- 
lationship between EBV and these diseases. These tests, 
however, are of limited value for definite diagnosis 
of the malignancies from a single serum, and cannot 
... be used for monitoring therapv. 

The introduction of antigen and antibody class specific 
tests, specifically the determination of peripheral IgA 
antibodies for the two antigen families EA and VCA and 
also the first attempts to subdivide at least the EA- 
faraily (EA, D or R; G. Henle, w. Henle and G. Klein, 
"Demonstration of 2 distinct components in the early anti- 
gen complex of Epstain-3arr Virus infected cells" , Int. 
J. Cancer 3, p, 272- (1971)) achieved remarkable improve- 
ments of the diagnostic and prognostic value of the tests. 

In the areas of high risk for NPC, 1% of the adult popu- 
lation has IgA antibodies for ESV-Capsid antigen (VCA) . 
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Three percent of this group has NPC upon clinical examination 
and, with the exception of terminal cases, there were no 
anti-VCA IgA negative cases detected. Out of the IgA 
anti-VCA positives, about 1% per year developed NPC in 
a 3 year follow up. A test of this quality, if available 
as a highly specific automat- readable ELISA test, would 
provide an excellent "first step" screening for a popu- 
lation of extreme risk. 



Detection of EB virus IgA/ VGA antibody is helpful for 
diagnosis of NPC (see table on page 14) , and of special value for the 
detection of early stages. For example, in Wuzhou City (China; 
high risk area for NPC) , the frequency of NPC detected 
by ..serological mass, survey revealed a much higher, per- 
15 centage of patients in stages I (42%) and II. (48%) than 
otherwise detected in outpatient clinics (1.7% stage I 
and 3o % stage II) . The chance of survival is clearly 
related to the stage at which- therapy is begun. The sur- 
vival, rates for" stage I are (according to Shanghai Tumor 
Hospital) 93%, for stage II 75%, and are. very low for 
more advanced stages. Therefore it is possible to reduce 
the mortality rate of NPC through early detection and 
early treatment. 

IgA antibodies to the early antigen complex of HBV can be 
detected in 40* % to 70 % of NPC patients, depending on the 
' method 'used. These antibodies are virtually absent in the 
--non-tumorbearing population. Such test of- the tumorbearing 
- -individuals should be of great importance for the "decision 
„ to start therapy, and its value would be even higher if the 

O w 

sensitivity could be enhanced to allow detection of disease 
...in. closer to 100 % of the tumor patients. 

„»T»-*The detection . rate • of NPC among IgA/VCA antibody-positive 
individuals is 1.9. % and that, of IgA/EA individuals is 

o o 

30-40 %. These data indicate that the IgA/EA antibody 
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test is more specific for the detection of NPC, but not 
sensitive as IgA/VCA antibody. 

A number of laboratories have used the continuous deter- 
mination of IgA antibodies to EA and VGA to monitor the 
success of theraDv and for eariv defection of reiaDse 
with very good success . 



Membrane protein gp 250/350 and its use 



Four proteins of the viral envelope constituting the so- 
called membrane antigen complex (MA) have been described 

w Pearson, et. al., supra; J. North, 

Mor ? an ' M.A. Epstein, "Observations on the E3 virus 
■envelope and virus -determined membrane antigen (MA) oolv- 
peptides", inc. j. Cancer 26, p. 231 ( ! 980 , , . ^ of ^ 

proteins, i.e. op 250 and on -jen- • . 

yp J50 are antigen*cally closely re- 

-atea (D.A,- Thorley-Lawson and X. Geiiinaer, -Mono- 
clonal antibodies against the major glvcoorotein (gp 
250/350) of E?stein-3arr virus neutralize inf ectivltv" , 
?roc. Natl. Acad. Sci, USA- 77, -p. - 5307 (1980) )": The 
molecular weight of one component ranges from 200,000 
to 250,000 D depending on the cell line where the virus 
is derived from and the second antigenetically related 
glycoprocein has a molecular- weight or 300,000-350,000 D 
but is absent in some cell lines.. Since these glyco- 
proteins are ail related, in antigenicity, procein and 
encoding DNA. sequence , they are. usually referred to 
as gp 220/350 or gp 2SO/3SO or simply as gp 250 or gp 350 
but meaning the whole family of related glycoproteins. 

Glycoprotein 250/350 is able...ta;bind- to the-SSW receotor 
of human and some primace 3-lymphocytes and- to thus 
initiate the infection of these cells (A. Wells, N. Xoide , 
G. Klein, "Two large virion envelope givcooroteins mediate 
E?stein-3arr virus binding to receptor-positive ceils", 



j;-Virol-;--4lV-p.--286"-(1982) ) .[-Antibodies against these 
proteins neutralize the infectivity of the virus, which 
could be demonstrated for human as well as for rabbit 
antisera and mouse monoclonal antibodies (D.A. Thorley- 
Lawson et. al., supra). By the use of monoclonal anti- 
bodies it has been shown that blocking of only one anti- 
genic determinant present both in gp 350 and gp 250 was 
sufficient for virus neutralization. Adsorption of human 
sera to immobilized gp 350 and gp 250 removed the 
neutralizing antibodies (D.A. Thorley-Lawson et. al . , 
supra) . Thus, there is convincing evidence that 

a) gp 350 and gp 250 induce the production of neutra- 
lizing antibodies, and that . 

b) antibodies against gp 350 and gp 250 have neutralizing 
capacity. 

Therefore, this protein- as- well as its related viral gene 
product,' gp 350 (with a molecular weight of 350 ,000), are 
candidates for a possible EBV vaccine (A.J. Morgan, M.A. 
Epstein, J.E. North, "Comparative immunogenic ity studies on 

Epstein-Barn virus' membrane antigen (MA) gp 34o with 
novel adjuvan ts in mice , . rabbits . and cotton-top taraarins" , 
J. Med. Virol.. 13, p. 281 (1984)). These glycoproteins 
are expressed on induced EBV producer cell lines and 
can be easily demonstrated after radioiodination of 
cell surface proteins (L.F. Qualtiere, G.R. Pearson, . 
"Epstein-Barr. virus-induced membrane antigens: immuno- 
chemical characterization of Triton X-ioo solubilized ■ 

viral membrane antigens from EBV super infected Raji cells", 
Int. J. Cancer 23," p. 808 (T979)). 
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Application of gp 25o/35o for the diagnosis of EBV-related 
diseases 

* 

IgG antibodies are absent during the acute phase of primary 
EBV infection, but present for lifetime after convalescence. 
IgM antibodies are present in the early stage of the 
disease and absent during convalescence. 
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IgA antibodies against EBV-antigens are present almost 
exclusively in NPC patients and can be detected in sera 
of .at least 58 % of these patients even: with not very 
sensitive tests (Zeng Yi and Hans Wolf, manuscript in pre- 
paration and example 16, infra). 

Comparison of Positive Rate of IgG and IijA Antibodies 
to VCA and MA from NPC Patients and Normal Individuals 



Cases 



MA/ IgG: 



MA/lgA 



No. 



rate% 



YCA/IgA 



(+) (+) ( + ) . ( + ) 
No. ratoYo No. rate* 



— ♦ • / T . - a 
•* ** / 1 ? ' * * 



(♦) 



1 1 >-> 



Patients 



48 



d.3 



100 



28 58.3' 



48 



100 



Of . o 



Normal 
Indivi- 
duals 



48 



47 



97.9 



0 



MA /IgG 2-nd !*A/IgA detected by immunofluorescence test 
VCk/I<rk and SA/IgA detected by irnraunoenzymatic test 



The whole gp 25o molecule or parts of its backbone poly- 
peptide chain can be utilized as reagents in preferentially 
35 class-specific antibody detection tests such as passive 
hemagglutination, counter gel electrophoresis, radio- 
immunoassays or enzyme-linked immuno-absorbent assays. 
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j- - Highly - specif ic. test antigens allow better signals and 
. detect otherwise unrevealed low antibody levels of 
. clinical significance. The use of singular antigenic 
sites of the gp 25o instead of the entire gene product 
5 may, in. some cases, permit a more precise, diagnosis 
of the disease. 

Application of gp 25o/35o for prop hylaxis and treatment 
of EBV-related disease s 

10 

A. Since infection by EBV early in life only causes sub- 
clinical seroconversion, it may be anticipated that 
the presence of. maternal antibodies or antibodies 
induced by a vaccine will influence the clinical 

15 manifestation of a primary EBV infection. It is 

expected that the vaccination of children or young 
adults, preferably before the peak of risk of catching 
an EBV infection, reduces effectively the clinical 
manifestation of infectious mononucleosis in the 

20 ' population.' 

B. in all areas with high incidence rates for NPC or BL, 
the population shows almost 1oo % seroconversion to 
EBV within the first one to two years of life. 

25 Vaccination will have to take place soon after birth. 

If this vaccination is regularly repeated, it will 
in all probability prevent EBV infection, delay it 
or reduce the biological effects of early primary 
infection. Each of these consequences is expected 

30 ' " to" either' prevent " the subsequent* development of 

" neoplasia, to delay its onset considerably or to de- 
crease the relative risk. 

35 



L 



J 
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1 C. In NPC, occasional production of viral antigens at 
the site of the tumor will stimulate primarily IgA 
- secreting B-lymphocytes. IgA antibodies are capable 
of blocking antibody-mediated cytotoxicity. IgA anti- 
5 bodies to viral membrane antigens, such as gp 25o, 
are present in NPC and BL patients and may not only 
be indicators of the disease, but may even contribute 
to the failure of the immune system to eliminate the 
tumor cells by their masking potential. Large doses 
of the purified antigen given to tumor patients may 
bind IgA and initiate the formation of an excess of 
IgG antibodies directed to the same antigen. These 
specific IgG antibodies may then compete with 
remaining IgA antibodies and allow the elimination 
15 of tumor cells by antibody-dependent* mechanisms . 

D. Appropriate administration of gp 25o or related pro- 
ducts might also enhance the cellular immune mechanisms 
and thus restrict the growth of tumors. 
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Production of EBV specific antigens according to the 
present invention 

1 . As a consequence of all findings, it is one of the objects of this 
invention to, improve the sensitivity of tests for detection of anti- 
body classes and antigen specific antibodies and to develop a system 
which allows mass testing and better standardization. 

2. EBV.. cannot be efficiently produced in a lytic cell 
.cycle since efficiently infectable. cells are not 

known at present and because all of the cells used 
as source for the preparation of EBV or related anti- 
gens are. immortalized cells or even tumor derived 
cells. In most cell lines retroviruses have been 
demonstrated. The products isolated from such cultures 
therefore are not only very expensive but their use 
is also a potential safety risk. 
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3. The application of recombinant DNA technology has 

- made -possible - the production" of useful" polypeptides 
by appropriate host cells transformed with recombinant 
DNA molecules and grown in appropriate culture" systems . 

4. According to the present invention, recombinant DNA 
methods are used to express the genetic information 

of the genes or at least of parts of the genes encoding the EBV pro- 
teins p138, p150 and gp 250/350 in appropriate host cells, such as 
bacteria (e.g. the genera Escherichia, Salmonella, Pseudcmonas 
or Bacillus) ,. yeasts (e.g. the genera Candida, or 
Saccharomyces) and mammalian cells (e.g. Vero-cells, 
CHO-cells or lymphoblastoid cell lines). * 

5 . Furthermore, the genomic regions encoding -the EBV proteins 
p150, pi 43, p138, p110, p105, p9Q, p80 and p54 ware identified and their 
relevance for diagnostic purpose has been identified. Therefore, the 
key information for the production of these proteins : ~or antigenic deter- 
minants thereof in a. manner as demonstratad for the proteins p138, p150 

and gp 250/350 is also, disclosed -in.. the present invention. 



Recombinant DNA technology -■- 

A. Expression control systems - 

Prokaryotes most frequently ■ are represented*- by- various 
strains of E. coli. However, other microbial ■ strains 
may also be used*, such~as' bacilli, ' for example- 
Bacillus- subtilis , -various- species- of -Pseudomohas , 
or other bacterial ' strains . ~In such pfok'aryotic ' ■" 
systems /'plasmid vectors which contain replication 
sites and control sequences derived from 1 . a ."species 
compatible with the host "are used/~For/'examplef ;E. coli 
is typically transformed * using ' derivatives of *pBR322, 
a piasmic derived from an E .' coli " species "by * Bolivar , 
et al., Gene 2, p. 95 (1977). pBR322 contains ' "genes for 
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arapicillin and tetracycline resistance, and these markers 
can be either retained or destroyed in constructing the 
desired vector. Commonly used prokaryotic control 
sequences which are defined herein to include pro- 
moters for transcription initation, optionally with an 
operator , along with ribosome binding site sequences, 
include such commonly used promoters as the beta- 
lactamase (penicillinase) and lactose (lac) promoter 
systems (Chang, et. al. Nature 198, p. 1056 (1977)) 
and the tryptophan ( trp) promoter system (Goeddei, et 
al- , Nucleic Acids Res, 8, p. 4057 (1980)). The lambda- 
derived promoter and N-gene ribosome binding site 
(Shimatake, et al., Nature 292, p. 128 (1981), which 
has been made useful as a portable control cassette 
are further examples. However, any available promoter 
system compatible with prokaryotes can be used. 

In addition to bacteria eukaryotic microbes, such 
as yeast,, may also be used as hosts. Laboratory strains 
of Saccharomyces cerevisiae, 3aker's yeast, are most 
used, although a number of other strains are commonly 
available. While vectors employing the 2 micron origin 
of replication are illustrated (J.R. Broach , Meth. 
Enz. 101, p. 307 (1983)), other plasmid vectors suit- 
able for yeast expression are known ( see , for example, 
Stinchcomb et al.,. Nature 282, p. 39.. (1979); Tschesipe 
at al., Gene 10, p. 157 (1980) and L. Clarke et: al. , 
Meth. Enz. 101, o. 300 (1983)). Control secuences for 

yeast vectors include promoters for the synthesis of 
glycolytic enzymes (Hess et. al., J- Adv. Enzyme Reg. 7, 
p. 149 (1963); Holland et al., 3iochemistry 17, p. 4900 
(.1978)). Additional promoters known in the art include 
the promoter for 3-phosphoglycerate kinase (Hitzeman 
et al., J. Biol. Chem. 255, ?. 2073 (1980)), and those 
for other glycolytic enzymes, such as giyceraldehyde- 
3-phosphate dehydrogenase, hexokinase, pyruvate decarhoxy 
iase , phosphor ruccokinase , glucosa-6-phosphate iso- 
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me rase , 3-phosphoglycerata mutase, pyruvate kinase, 
tr iosephosphate isomerase-, phosphogiucose isomerase, 
and ciucokinasa. Other promoters, which have the 
additional advantage of transcription controlled by 
growth, conditions are the promoter regions for alcohol 
dehydrogenase 2, isocytochrome C, acid phosphatase, 
decrradative enzymes associated with nitrogen metabolism, 
and enzymes responsible for maltose and galactose 
utilization (Holland, supra) . 



Evidence suggests that terminator sequences are de- 
sirable at the 3 ' end of the coding sequences. Such 
terminators are found in the- 3 1 untranslated region 
following the coding sequences in yeast-derived genes. 

15 Hanv of the vectors illustrated contain control, sequences 
derived from the enolase-I gene containing plasmid 
peno46 (M.J. Holland at aL, J. Biol.* Chem. 256 , p. 1385 
(1981)) or the LZU 2 gene obtained from YEpT3 (J. S roach 
2- al . , Gene 8, -p.- 1 21 (-19 79)), however, any vector 

20 containing a yeast- compatible promoter, origin of 

replication and other control sequences is suitable. 

It is also, of course, possible to express genes en- 
coding polypeptides in eukaryotic host ceil cultures 
25 derived from multicellular organisms. See, for example, 
Cruz and Patterson, editors, "Tissue Cultures" , Academic 
?rsss ( 1973). Useful host cell lines include. VERO and 
He La ceils, and Chinese hamster ovary (CHO) calls. 
Expression vectors for such calls ordinarily include 
30 promoters and ccncrol sequences compatible with mammalian 
calls . such as, for example, the commonly used early 
and lata oromoters from Simian Virus 40 (SV 40) (Fiers 
at ai . , Mature 173., p . _ 1 1 3 ( J 9 78 ) ) , or other viral 
promoters such as those derived from polyoma. Adeno- 
35 virus 2, bovine papiioma virus, or avian sarcoma virusses. 
General asoeczs of mammalian call host svstam transfor- 



options have been described by Axel in Q. 3'. Patent: Mo. 
4,399,216 issued August- 16 , -1983 It now; appears - 
also that "enhancer" regions are important in optimizing 
expression; these are, generally, sequences found frequently up- 
stream o£ the Promoter region. Origins of replication 
3iav be obtained, if needed, from viral sources. However, 
aene in t aeration into the chromosome is a ccmmcn 
mechanism for DNA replication, ir. aukaryctes , and hence 
independently replicating- vectors are not required. 

Plant ceils are also now available as hosts , and 
control secuences contDatibie with, olant ceils such 
as- the ncpaline synthase promoter and poiyadenyiation 
signal sequences (A. Depicicer etr. al . , J. Moi. Appi. Gen. 1 
p. 561 (1982)) are available. - 

Transformation of suitable hosts 

Deoendinc on- the ho st ceil used, transformation is dene 
us incr standard techniaues accropriate to such calls. 
The calcium treatment employing calcium chloride, as 
described by S.N. Cohen, ?rcc. Natl. Acad. Sci. (USA) 59, 
o . 2110 (1972) is used for crokaryoces or other ceils 
which contain substantial ceil wail barriers. Infection 
with Acrcbacterium tumef aciens (C.H~. Shaw et ai . , 
Gene 23, p. 215 (1982)) is used for certain plant ceils. 
For mammalian ceils without, such ceil wails, the 
calcium ohosphate precipitation method of Graham 
and van der Z'o (Virology 52, p. 546 (1978)) 'is pre- 
ferred. Transformations into yeast are carried "out 
accordinc to" the 'method of ?. van Soiingen et ai. 
JJ. 3act. 1 30 ; p. "946" (1977)) and C. L. Hsiao et ai. 
(Prcc. iTatll. Acad. 5ci. (USA) 76, p. 2829 "( 1 979 )) . 
Alternatively, the procedure of ' Kiehe~,~ et aiV-(^ene 25 r - 
p. 323 (1983)) can be used/ 
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1 C. Construction of recombinant cloning and expression 
vectors . . ■ _ . ■ 

Construction of suitable sectors containing ...the .desired 
coding and control sequences employs standard ligation 
and restriction techniques which are well understood 
in the art. Isolated piasmids, DMA sequences, or. 
.synthesized oligodeoxyribonucleotides , are- cleaved, 
tailored and reiigated in the form desired. 
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Site specific DNA. cleavage is performed by treating 
with .the suitable .restriction -enzyme ~ (or .enzymes) 
under conditions which. are generally .understood. in 
the art, and the particulars of which are specified 
by the manufacturer of these commercially available 
restriction enzymes. See, e.g., New England Biolabs, 
Product Catalog. 

If desired, - size separation of the cleaved fragments 
may be performed by polyacrylamide gel or agarose gel 
electrophoresis using standard techniques. A general 
description of size separations is found Jin "Methods 
in Enzymology". 65, p.. 499-560 (1980).., 

* * ^m. » — — « » » - . * * » . 

* F ♦ . » * * * ■ * . * 1 . - ' 

Restriction cleaved fragments may be blunt ended 

by treating with the. .'.large "fragment of E7"coii .DNA 
polymerase. I (Klenow) _ in. the presence of the four 

deoxynucleotide. triphosphates (cNT?s) . The Klehbw 

fragment- fills, in at_5 '..sticky ends but chews back 

protruding- 3* single, strands , even though the" four 

dNTPs. are present... If. desired, selective : rVp~air" 

can. be performed bv.~suoolyina onlv a selected one 

.or_mQce,dNT?s within, the- limitations. -dicta ted by 'the . 

nature of the sticky ends. Treatment under 'appropriate 

conditions with" sV nuclease results in- hydrolysis of : 

any" single-stranded - ' portion .~ " " : - • . : : : 
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Synthetic , oligonucleotides may be prepared by the 
triester method of Matteucci et al. (J, Am, Chem. Soc. 103, 
p. 3185 (1981 J) or the diethylphosphoramidite method 
of Caruthers, described in CJ.S. Patent No, 4,415, 732, 
issued November 15, 1983. 

Ligations are performed under standard conditions and 
temperatures (as described below) using T4 DNA ligase. 
In vector constructions emolovinc "vector fraaments", 
the vector fragment is commonly treated with bacterial 
alkaline phosphatase (BAP) in order to remove the 5' 
phosphate and prevent religation of the vector. 3AP 
digestions are carried out under standard conditions 
(as described below) . 

Selection of transf ormants 



In the constructions correct ligations for piasmid 
"construction are confirmed bv transf ormina E. coli 
or other suitable hosts with the liaaticn mixture. 
Successful transf ormants are selected by ampiciliin, 
-tetracycline or other antibiotic resistance or 
using other markers depending on the mode of piasmid 
construction, as is understood in the art. 

Brief summary of the invention 

The present invention relates to the production of EBV 
specific* antigens by recombinant DNA technology and their 
use in diagnosis, prophylaxis and therapy of EBV-related 
dieseases. Therefore, it is an object of this invention 
to identify novel Epstein-Barr viral antigens, such as p150^ 
P 143;"pi38, P110, p105, P 90, P 80, p54(G.J. Sayliss, H. Wolf 
infra)",' which are "correlated with Epstein-3arr " virus related 

diseases like nasopharyngeal carcinoma (NPC) , infectious 
mononucleosis, and 3ur!<itt's lymphoma (see legend to 
Fia. 1 and Fia. 28) bv immunolocrical methods. 
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Another object of this invention is the localization and identif ica- 
tion of gencmic regions of EBV, for example : as it has been cloned 
frcm B95-8 cells (American* Type Culture Collection, Rockville, 
Maryland, USA (ATCC) CRL1612) (J. Skare, J.L. Stxcrainger, "Cloning 
and napping of BamHI endonuc lease fragments of the DNA frcm the 
transforming B95-8 strain of Epstein-Barr Virus", Proc. Natl. Acad. 
Sci. USA 77, p3860(1980)) coding for said antigens* of diagnostic im- 
portance' and of relevance for medical purposes . This is 
achieved by using the hybrid selection method. 

A further object of the present invention . is the sub- 
cloning of "a genomic region of "EBV, for example from 
existing libraries of EBV, cloned from 395-8 cells which 
concedes at least a part of useful antigens for 

inadical purposes, such- "as p138 and p150. This is achieved by joining a 
subgenccnic fragment, e .g". "aiXhbl- fragment, "derived* from the 
EBV B9 5-8 subclone pBR322 BamA, (J.Skare et al.., supraj to the plas- 
mid pUC8 (J. Massing; infra) ( pUC635 , see Fig. . 4). 

20 Another. objecf of this invention is the "production of 
proteins by expression of the respective genetic 

information .in suitable host cells, such as bacteria 
(e.g. of. the- genera Escherichia, Salmonella, Pseudomonas or 
3acillus) , yeasts (e.<j. o_f the genera Candida or Saccharonyces) animal 
cells and human- cells { e .q. Vero-cells; CHO-cells; CHO dhfr*" 
cells in combination with an appropriate selection system,, 
optionally a plasmid carrying a functional dhfr gene as well 

♦ -». » 

as the genetic information for the EBV gene ' under control 
of a suitable regulation sequence; or lymphoblastoid cell 
lines). The proteins produced by these host cells contain 
e.g. p138, pl50 or gp250/350-related antigenic determinants 

.. . _ Jepitopes) and are,' depending on the expression system, 
^^SYDthesized either. as a fusion protein or as a. non-fusion 
protain. 

35 • • ... 
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For the production of a fusion protein by bacteria 
the expression of the genomic subf ragments, f or example 
that encoding a part of p138 of EBV B95-8 and introduced into 
the kncwn plasmid P UC8, was induced e.g. by. isopropyl-5-^tliioglacto- 
% ^ { P G) • The respec tive expression products 

were identified by immunological methods. 

Another fusion protein is provided bv cleavinc subclone 
pOC635 with EcoRI and Bglll and introducing this frag- 
ment into the vector plasmid pUC9 (U. Riither, infra) . The 

resulting recombinant plasmid is pj3C924 (Fig. 6) . The expression 
product has a size of about 94 kd. 

A further fusion protein is produced by expressing the 
genetic information of said XhoI-p1 38 -encoding fragment 
of p8R322 BamA in the plasmid pEA305 (E „ Amarm, J. Brosius, 
M. Ptashne, "Vectors bearing a hybrid trp- lac -promoter 
useful for regulated expression of cloned genes in E. coli" , 
Gene 25, p. 167 (1983)). After putting the p138 related 
information into a proper reading frame relative to pEA305, 
the. clone pMF924 synthezises a fusion protein that contains 
a part of the A- -repressor protein (Fig. 7) . 

Still another fusion protein is provided by cloning a 3.0 kb 
genomic Xhol-fragment containing p138-related genetic in- 
formation 3' to a hybrid trp-lac promoter (as described 
by F. Amann et. al., supra) . For this purpose the known plasmid 
OKX240-T1 was used. The resulting clone pKX378 synthezises 
a fusion protein that is composed of an aminoterminal 
methionine residue followed by p138 related DNA sequences 
(Fig. 8) . 

Still another object of the present invention is to oro- 
vide fusion proteins, non-fusion proteins or oligopeptides 
v^iich contain only the antigenic determinant protein sub- 
regions of viral proteins like p138. For this purpose 
the determinants of the protein are located by a computer- 
directed analysis, using a computer program developed by 



017325 



-.4 



10 



15 



20 



25 



.30 



- 25 - 

us for. the Digital Equipment VAX 1 1/750 computer. A similar 
program has been used for other problems and another com- 
puter by G.H. Cohen, 3. Dietzschold , M. Ponce de Leon," 
D.. Long, E. Golub, A. Varrichio. L. Pereira, R.J. Eisenberc, 
"Localization and synthesis of an antigenic determinant of 
Herpes, simplex virus glycoprotein D that stimulates the 
production of neutralizing antibody", J. Virol 49, p. 102 

(1984) . By cloning the respective fragments into vectors 
like pUC8 or pUR288 (U. Ruther et al. , infra) plasmids 
as pUR600 and pUR540 were obtained. The produced large 
and small fusion proteins are investigated by gel 
electrophoresis and immunob lotting, experiments* The clo- 
ning experiments in pUR288 were done for stabilizing 
the small p138-related polypeptides. 

A further object of the present invention is the expression 
of polyantigens composed of antigenic determinants of 
several different EBV-serotypes . -For that purpose the 
corresponding- DNi^f ragments are* "linked and introduced 
in a suitable vector." The expression products are fusion 
and non-fusion EBV-specific polyantigens. 

Further fusion proteins containing p150-related antigenic 
- determinants were obtained by cloning and expression of the 

corresponding DNA sequences in pUR plasmids and pUC plasmids. 

The- obtained constructs "were the -recombinant plasmids. 
' pUR290CXH580, pUR290DBX320 , pUR292DBB1 80 , pUR290DTT70 0 , 

pCJRDTT740 , pUR290DTP680 , and pUR288DPP3 20 . .-: 



-Anothen..object-ofc r the present invention .is the construction 
°^i neWi expression, vectors, such as. pUCSOO and pUC601 
wh A9^„contain a_ part. o£ the. . coding^ region .of _the .viral 
protein p138- r If DNA sequences* are introduced" 3' "to."th±s 
35 sequence into the vector, the expression products are 
stabilized by the pi 38-specif ic aminoacid sequence and 
protected against protease degradation. 
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1 Still another object of this invention is the modification 
of said expression vectors by introducing a DNA-seguence 
coding for three to fifteen Arginine residues and at 
least one stop codon 3* of the cloning site of said ex- 

5 press ion vectors and furthermore positioned in an 

appropriate reading frame. The obtained vector is pUCARG601. 
If DNA-sequences coding for proteinaceous material are 
inserted into this expression vector the expression 
products will be fusion proteins carrying said Arginine 
10 residues at their carboxy terminus, such as those 
fusion proteins encoded by plasmids pUCABGT140 (see 
Fig. 12a)) and pUCARG680 . 
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15 Thus it is an object of this invention to provide a simple 

method for isolating- proteins useful for diagnosis, prophylaxis and the- 
rapy such as EBV P 1 38 or related polypeptides or oligopeptides 
(antigens) from the host cell lysate according to the 
method of H. M- Sassenfeld, S.J. Brewer ("A polypeptide 
fusion designed for the purification of recombinant pro- 
teins" , Bio/Technology 2, p. 76 (1984) ) . 
By^iiitrociactian of said Arginine residues 

the net charge of the expressed proteins becomes/positive 
and after lysis of the host cells the oligo-arginine 
linked proteins are isolated by a S? Sephadex C-25 column 
chromatography. Due to the oligo-arginine group ^ v 
specific proteins are eluted at a high NaCi-concen-raticn . 
This eluate is then treated with carboxypeptidase 3 which 
degrades carboxy-terminal lysine and arginine residues. 
Finally another S? Sephadex C-25 chromatography is carried 
out wherein the SSV-related proteins are eluted at low 
salt concentrations (see Fig. 16). It is evident, however, 
that this procedure may be used akso for the purifi- 
cation of proteins secreted into the medium. 
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It is also evident that other established methods for 
protein-purification such as molecular sieving or affinity 
chromatography .on ion exchange columns or columns" loaded 
with specific antibodies to the expressed proteins can be 
used as additional or alternate purification methods. 
For the production of non-fusion proteins which essen- 
tially contain amino acid sequences of the naturally 
occurring proteins or parts thereof the recombinant plas- 
mids of the present invention may be modified. If an 
oligonucleotide linker is inserted between the bacterial 
protein encoding region and the EBV-related protein encoding 
region of the expression vector, the amino acid sequence 
corresponding to the oligonucleotide linker becomes part 
ot the expressed fusion protein. After isolation of this 
fusion protein from the transformants expressing it, it is 
cleaved either by amino acid sequence specific proteases 
in the introduced aminoacid linker or, if the amino acid 
linker comprises peptide bonds sensitive to acid cleavage,, by 
treatment with acids, e.g. formic acid. 

A further object of this invention is the cloning of a 
genomic region of EBV coding for at least a part of 
the specific viral antigen gp 250 and. gp 350. This is 
achieved by .joining a subgenomic Pstl-PstI fragment of the 
EBV genome frcm the cell strain 395-8 (ATCC CRL 1612) (R. Baer et al. , 
infra) contained in pBR322 BamL (J. Skare et al, supra), to the plasmid 
pU28 (J. Messing et al. , infra) . The resulting reoembinant plasmid 
is designated as. pUCLPI .9 (see Fig. 19). 

For the production of a fusion- protein by bacteria, a 
genomic subfragment coding f or" a- part of gp' 250" : and gp 350 

***** * »* + 

of EBV B95-8 was cloned into the vector pUR 290 "(U. 
Riither et al. , infra) which carries a region of" the lacZ 
gene coding for the enzyme fl-galactosidase " (pURLPV. 9 , 
see Fig. 20) . The respective expression product was 
purified an< * identified by immunological methods. • 
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Still another object of the present invention is to pro- 
vide fusion proteins or non-fusion proteins which con- 
tain only the antigenic determinant protein subregions 
of gp 250 and gp 3 50. For this purpose the antigenic 
determinants of the proteins were localized by a computer- 
directed analysis using a computer program developed 
by us for the Digital Equipment VAX 11/750 computer. 

The respective DNA- fragments are then cloned in a conventional 
expression vector such as pUR ( B-galactosidase) (U. 

Riither, et al., infra) . Plasmids obtained were e.g. 
PURLEP600 and pURLXP390 (see Fig, 27) . 
Furthermore, the N-terminal antigenic determinant of 
gp250/350 was expressed as a fusion protein in a 
pUC vector ( pqCLEP600 , see Fig. 27) . 

Another fusion protein is provided by cloning a DNA-frag- 
ment coding for the N-terminal antigenic determinant of 
gp 250 and gp 350 into the expression vector p(JCARG601 
mentipned - above . 

A further object of the present invention is the ex- 
pression of polyantigens containing several antigenic 
determinants of gp 250 and gp 3 50 located by said computer 
analysis. For this- purpose the corresponding DNA frag- 
ments are linked and introduced in a suitable vector. 
The expression products are fusion and non-fusion EBV- 
specific polyantigens. 

A final object of the present invention. is the utilization 
of either said EBV-related proteins or subregions thereof or, .if suit- 
able, -EBV-relatei-DNA fragments or clones, for the production 
of diagnostic compositions (kits) useful in clinical diagnosis or 
scien . tLfic , res ? ar ? h - These, tests are based .on- principles as ELISA 
(Enzyme-linked iramuno sorbent assay) , RIA (Radio iitinuno assay) or the 
indirect hemagglutination assay. Furthermore / the E3V- 
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related proteins can be used, e.g. for monitoring* 
vaccination programs, analyzing epidemiological" problems 
"for patients treatment, and for the production 
of vaccines for prophylaxis and therapy of EBV-related 
diseases, such as mononucleosis, Burkitt ' s 'lymphoma and naso- 
pharyngeal carcinoma. Vaccines can be manufactured according 
to conventional methods. Unit doses are filled in vials 
optionally together with a conventional adjuvant "such 
as aluminium hydroxide. Alternatively the product may 
be administered in the form of aggregates with liposomes. 
Patients may be vaccinated with a dose sufficient 

to. stimulate antibody formation and revaccinated after 
one month and after 6 months. 

Finally the proteins are useful for prophylaxis and 
therapy of EBV-related diseases because they-are^ 
able to modulate the immune response in patients 
suffering from diseases such as NPC, chronic infectious 
mononucleosis or EBV-related Burkitt Vs. -lymphoma. 

Brief descript ion of the drawincrs 

Figure 1 ; Autoradiography of an immunbprecipitation of 
ESV-specific sera derived from patients suffering from 
mononucleosis and NPC. " 

The immunoprecipitated 35 S-labelled proteins were se- 
parated by a SDS-polyacryiamide gel electrophoresis 
and an X-ray-film- was exposed -to the.gel^ .. r 

The. sources of the.-dif ferent. sera used for: precipitation 
are given at the- bottom of the- respective- regions^ of • the- • - 
autoradiography. The control, designated "pool", contains 
- a1 . 1 - ?_ f . . the_iimuno^ proteins . 
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It can be taken from the autoradiography that at least 
antibodies to pi 38 , piOS and p30" are present in each of 
the NPC sera and only in some of the other EBV- infection 
specific sera. In analogy, antibodies to p54 are signific 
rcr fresh E3V infection (infectious mononucleosis) as 
compared to convalescent state. Antibodies to ?15o, pi 43 
pllo, p90 are also present in convalescent sera of healt; 
individuals and can serve as markers for immunity or, 
in connection with IgM specific tests, for fresh EBV in- 
fection or, in connection with IgA, for a specific test 
for Z3V-reIated neoplasia (NPC and 3L) . 

Figure 2: Mapping of mKNA's relative to the E3V 39 5-3 
genome . 

The 3amHI restriction sites of the EBV B95-3 genome are 
given at. the bottom of the- figure and the respective 
restriction fragments are designated by upper and lower 
case letters. The mRNA's of the proteins localized by 
hybrid-selection to individual 3amHI restriction fragment 
are indicated by numbers and lines. 

It can be taken from the figure, that the gene of pi 38 
was correlated to the 3amA-f ragment . 

Fiaurs 3: DNA sequence of the leftward reading frame of 
3amA- encodina o138. 

The sequence shown is the respective negative strand. 
The pi 38 encoding region starts at nucleotide position 182 
and ends at nucleotide position 3565. Restriction sites 
used for cloning of fragments of this coding region are 
indicated. - . 



Ficure 4 : Restriction map of the plasmids puC535 and 
?GCS13o. 
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The size of the vector p(JC8 is 2.7 kb. The cloning site 

3' of the lacUVS 6-gaiactosidase promoter and operator 
(PO) conrains EcoRI(E), BamHI(B), Sail (S) ,- -Psti (p) v - 
and Hindlll(H) site;- The B-lactamase gene is- indicated - 
by .\MP. The 3.o kb and 3,3 kb Xhol-fragments of the 
pi 38 coding region are inserted into the Sail site of pCJCS. 
The insertion is indicated by an open bar, - pUC635 contains 
the 3.o kb Xhol-f ragment in a correct reading frame relative 
to the fl-galactosidase gene, whereas pDC613o contains the 

* 

3.3 kb Xhol-f ragment in the opposite orientation* 

Figure 5: . Expression of the proteins encoded by plasmids pUC635, 
pUC924, pMF924, and pKK378. 

4 • * - » ****** • 1 # «^ 

Lane 1 * of the immunostained Western-blot shows the proteins 
isolated from bacteria transformed # with pUC8_and induced 
with IPTG, • ■• ; - -- 

Lane 2: proteins of pUC924 transformed bacteria 
Lane 3: proteins of pKK378 transformed bacteria ; 
Lane 4: proteins of pMF924 transformed bacteria" 
Lane 5: proteins of* pUC635 transformed bacteria 
The size of the fusion protein was estimated to be 

75kD (lane 2), 110 kD (lane 3), 90 kD (lane 4) - 

and 135 kD (lane 5). - ' ~ 

Figure 6: Restricrion map of the plasmid p(JC924 ... 

The size of the vector ?QC9 is 2.7 kb. The cloning site 3-' 
of the lacUVS 3-galactosidase promoter and operator (PO) 
contains "an EcoRI(E), 3amHI (B) r Sall(S), PstI(P) r and 
HindllKHj" site. The" B-iactamase "gene 'is ' indicated by AMP. 
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The 2.6 kb Bglll/EcoRI-f ragment of pUC63 5 is inserted 
between, the BamHI. and EcoRI sites. The abbreviation of 
Bglll is "'Eg". 

Figure 7 : Restriction map of the piasmid pMF9 24. 

The 2,6 kb BamHI /HindIII-f ragment of puC924 was. inserted 
into the BamHI and Hindlll restriction sites of p£A3o5 
which are located 3' of the hybrid trp-iac promoter 
(tac) and the ^minoterminal coding region of c^ (A-repressor 

F igure 8 : Restriction map of the piasmid pKK378. 

The 3-3 kb BamHI /Hindi II -fragment of pUC613o was. inserted 
into the Hindlll-site of the vector pKK24o-11 using a 
345 bp BamHI/Hindlll-fragment of pBR322 as a linker (which 
is indicated by a heavy black line) . Thus the p138 en- 
coding fragment is located 3 1 of the hybrid trp-lac 
promoter (tac) and an ATG start codon. 

Figure 9: Secondary structures of p138. 

Computer plot of Chou-Fasman calculation of the p138 
secondary -structure . Additionally, the hydrophobic 
(closed circles) and hydrophilic (open circles) regions 
are indicated. 

Antigenic sites can be expected in hydrophilic regions 
with a B -turn. This situation is given in the p6oo region 
and at the carboxy- terminus cf the protein. 

The regions subcloned into the vectors pUCS and p(JR288 
are indicated. 



Figure 10 ; Expression products of bacteria transformed 
wi th t he . .. plasmid pUR^carrying.Pstl. fragments of p138... . 

A. A coomassie brilliant blue stained SDS polyacrylamide 
slab gel analysis of lvsates of IPTG induced bacteria 
carrying the- various plasmids.is shown. Fusion proteins 
with molecular weights between 120 and 150 kd are in- 
dicated with a closed circle. Track lM molecular weight 
markers, tracks pUR400-p(JR540 lysates of bacteria 
carrying plasmids containing the regions of pi 38 as 
shown, in Fig. 3 . 

3 . An enzyme-linked immunoassay.; of proteins transferee 
from a gel (similar to that, shown, in panel A) onto 
nitrocellulose paper (Western blot) is shown. 
In this assay a pool of high titered antiserum was 
used and after washing, the bound immunoglobulins 
were visualized by sequential reaction with peroxidase 
coupled to antibodiesr -against. human IgG and diamine- 
benzidine.. .Only fusion proteins from bacteria con- 
taining pCJR600 and p(JR540 show specific reactions. 

Plasmid p(JCS35 (as a positive control) contains 
almost, the.. whole.,o£. pJ38..coding :: .region,.. however the . 

protein is unstable and is rapidly degraded." pUC8 
is the negative control containing the vector plasmid 
free from EBV derived seauences. 
Figure 1.1: Expression .product . of bacteria transf ormed 
with_the pCJC subclones carrying PstI -fragments of p138. 

-An__enzyme-linked immunoassay .of proteins electrophoreticall 
transfered from a.gel.".onto nitrocellulose paper (Western . 
blot) was carried out'.- In*. this assay a. pool of high titered 

■antiserum was .used- and -after, washing / the bound immuno- 
globulins, were -visualized, bv secruential reaction with 
peroxidase- coupled, to antibodies against human IgG and 
diaminobenzidine.. The, fusion protein from bacteria con- 
taining pCJCP600 was stably produced and shows a specific 
antiaenic reaction . 
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Figure 12: Construction scheme for plasmid P UCABG1 1 40 encoding both 
antigenic sites found by expression as fi-gal fusion proteins 

a) The 5' -PstI site in pUC600 was removed by digesting with 
SstI (20bp upstream) and Hindlll followed by the ligation 
with pUC12-SstI/HindIII. From this plasmid the insert was 
removed with EcoRI and PstI and ligated into pUC8-EcoRI. An 
oligonucleotide coding for five arginines and two stop 
codons was inserted into the resulting plasmid pUC601 as 
single-stranded DNA between the 3' -PstI site and Hindlll 
(pUCARG601). In a last step the 540bp PstI fragment encoding 
the second antigenic determinant from the C-terminus of 

pi 38 was inserted by digestion with PstI and ligation. 
The resulting plasmid contained both antigenic sites in 
frame followed by five arginine-residues . It was designated 
as pUCARG1140. 

b) Nucleotide sequence of the oligoarginine linker. The lower 
strand was synthesized and inserted as a single-strand DNA 

via bridge formation between the sticky ends of PstI and Hindlll. 

Figure 1 3 : IPTG-induc ed expression of the olasmids oUCSOO, 
pUC601, pUCARG60T and pUCARG1140 with-pUC8 as a- control 
The upper part shews a Cocmassie-stained SDS-PAGE. The newly 
detected proteins are marked by a black dot. The lower 
part shows the corresponding western blot obtained after 
immunostaining with serum from NPC patients. In comparison 
to pUCP600 the EBV-related protein encoded by pUC601 is 
about 1.5kD smaller due to the lack of 14 aminoacids (6 amino- 
acids encoded by the pUC-polylinker and 8 from the Pstl-SstI 
fragment) . The size of the protein encoded by p(JCARG601 
is further reduced for about 11kD since the read through 
into the lacZ region of pUC is inhibited by stop ccdons pre- 
sent in the inserted oligonucleotide. In pUCARG1140 the size 
increases to about' 4 2 kD due to the insertion of the 
540bp fragment. The protein is stable in bacterial cells. 
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Figure -14 ; ' Distribution and reactivity of the I gG and 
IaA antibodies of individual NPC-sera against the 
two epitopes detected in p138. 

Lysates of, IPTG- induced E.coli cells carrying the indicated 
plasmids were independently separated on a 12% SDS-PAGE 
four-times and the proteins were transferred to Nitro- 
cellulose by Western-blotting, Lanes 1: pUR288 as 
negative control; lanes 2: pUCARG1140 as a positive 
control; lanes 3: pUR540; lanes 4: pUR600. Two indi- 
vidual NPC-sera (no* 352 and 354) were incubated with 
the filters and the bound IgG and IgA antibodies were 
visualized using peroxidase conjugated anti-human IgG and anti-human IgA 
rabbit antibodies. The different locations of the 
proteins in the Western blots, especially of pUCARGIUO, . 
result fronrdifferent electrophoresis times of the 
SDS-PAGEs. 

Whereas in NPC-serum no. 352 the main reaction of the 
IgG and IgA antibodies is -directed against the P540 epitope 
from. the : C-terminus of p138 (see Fig. 9) in serum no. 354 
the main part of the anti-p138 antibodies recognizes the 
P600 epitope (see Fig. 9) . This indicates that both anti- 
genic sites are necessary for detecting anti-p138 anti- 
bodies in sera. 



Ait- 0 



urotein encoded by plasmid pUCARG1140 



Figure 15; ELISA test usin< 
as antigen. 

Row 1 and 3: EBV-negative sera , row 2: NPC pool serum, 
row 4-13: individual NPC sera. The dilutions tested are 
indicated at the bottom; left lane: IgG- right lane: IgA.. 
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Figure 16,: Purification of proteins- carrying oligo-arginine 
groups a t their carboxy-terminus . , _.. r ._- 

A. Sequence of the oligonucleotide . encoding five argine 
residues and two stop codons . A* Hindill-site" at the 
5* -end and a Pstl-site at the 3* -end were generated 
for the insertion of the oligonucleotide into pUC8 . 

B. Purification scheme of insoluble expressed eukaryotic 
proteins carrying said Arg-linker at their carboxy- 
terminus. 



Fig. 1 7 : 



nM . sequence of "^"i a ftward re.ning frane of the Bam L-fra< 
ment encoding <?P 250/350 . 



The coding region for the glycoprotein starts at genomic 
position 92,53 and ends at position 89433. The sequence 
shown is the respective- negative, strand, b«£ni^ 
^Banfil site at position 92703. According^ to the sequence 
numbering- in-this figure the g P 350 ^encoding reg.cn- xs .. 
located between position 556 and 3276. A TATAA-box in 
the region of basepair 520 is marked with .... the 
probable poly-adenylation site at position 3290 -th 
+++ . The splice donor and splice accept sites are -mcxca tea 

j \ / f nT - the acceptor site. 

by )( for -donor and. ).l "v?? ttt - ;v r 

^hydrophobic, region near the carboy terminus 
of the- coding: region: i^ -r^with . .Probably- thxs^ 
aminoacid sequence serves as an anchor sequence for_ 
fixing the protein to the membrane. 
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Fig. 18: 

• • •» 

Restriction map and open reading frames of the Bam L- 
fragment . .... 

A. Restriction map: 

The positions of the restriction enzymes- Bam HI, 
EcoRI, Hindu I and PstI are indicated relative to 
the nucleotide positions of the Bam L-fragment. " 

B. Open reading frames: ... ...... 

The open reading frames of the Bam L-fragment are 
indicated as boxes and given for both polarities 

of the respective DNA sequence. 

Restriction map of the plasmid pCJCLP1.9 """" 

The size of pUC8 is 2.7 kb. The cloning region 3 r 
of the LacUVS B-galactosidase promoter and operator (PO) 
contains an EcoRI(E) , BamHI (B) , Sall(S), Pstl(P), and. 
Hindlll(H) site. The 1.9 kb subfragment of the Bam-L-. 
fragment, indicated by an open bar, was inserted - into 
the PstI site. The reading frame has the same orientation 
as the lacZ-coding part of pUC8 (indicated by a heavy 
black line) . 
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Fig. 20 ; 

Restriction map of plasmid pURLP1. 9 

The vector pUR290 has a length of 5.2 kb and consists 
of the B-lactamase gene (AMP ) and the origin of re- 
plication of pBR322. The B-galactosidase gene is indi- 
cated by a heavy black line, the respective promoter- 
operator region by PO. The restriction enzymes are 
abbreviated as follows: BamHI(B), Clal(C), EcoRI(E), 
HindlIKH), Pstl(P), and Sail (S). 

The 1.9 kb insert of pUCLP1.9 was introduced between the 
BamHI and the Hindi II site. 

Fig. 21 : 

DNA- an d amino acid sequence of the fusion protein encoded 
by plasmid pCJRLP1.9 

bp 4 - 3069: 3-galactosidase 

bp 3070 - 3072: pUR290 linker (given in low letters) 
bp 3073 - 3088: p(JC8 multiple cloning site (BamHI to 

PstI; given in' low letters) 
bp 3089 - 4985: PstI fragment of gp 350 
bp 4986 - 4-994: pUC8 multiple cloning site 

(PstI to Hindlll; given in low letters) 
bp 4 99 5 - END: pBR3 22 sequence. 

Fig. 22: 

Expression of the 6 -gal : gp 350 protein enccced by plasmid pURLPI .9 

Lane 1 and 2 show a coomassie blue stained PAGE of an 
uninduced (lane 1) and an IPTG induced (lane 2) pURL?1.9 
containing clone. 
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Since there are a lot of bands with different molecular 
weights, it seems that the main part of the protein is 
incompletely syn'thesized . 

Lane 3 shows a peroxidase-DAB stained Western blot with 
NPC sera. It is demonstrated that all newly expressed 
proteins are antigenic, except that band according to 
the size of 116 JcD which corresponds to the B-galactosi- 
dase. 

The bacterial background bands are due to the high content 
of antibacterial-antibodies in the serum used. 



Fig. 23: 

Purification of the S-gal : gp 350 fusion protein encoded . 
by plasmid pURIP1.9 

A. Coomassie stained gel; B. Western blot, treated with 
NPC serum 

Lane 1 : Uninduced culture 
Lane 2: IPTG induced culture 

Lane 3: Insoluble proteins ~ of the lysed~"bacteria-, -disolved 
in 8M urea 

Lane 4: B-gal : gp 350 protein containing fractions, pooled 
after Sepharose 2B-C1 chromatography. 



,Fi ? ure 24j Computer-predicted secondary structure ' 
of g P 350 comprising the relative values- of hydrophilicity 
(dark circles) and hydrophobicity (grey circles), m ' 
the scale given only the loop structures can be -seen 
clearly as line turns of 180° 
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Figure 25: Expression of gp350-f ragments as B-gal fusion 
proteins. 

The coomassie blue stained expression products • encoded by plasmids 
pCJRLEP600 and pURLXP390 are shown in the upper part 
(pUR288 as control) . In the lower part the same probes 
are shown after iramuno staining for demonstrating their 
reactivity with EBV-positive sera. 



Figure 26 : Expression of the proteins encoded by pUCLEP600 
and pUCARG1230 and their reactivity against EBV-positive 
sera with pCJC8 as control; upper part: coraassie-stained 
SDS-PAGE, lower part immuno stained westernblot 



Figure 27: Restriction map of the region coding for 
gp 250/350 

The dark bar indicates" the- region coding for gp 250/350. 
Furthermore the restriction enzymes used for subcloning, the splice 
sites, and the inserts of the recombinant expression 
plasmids constructed according to examples 13 and 15-17 
are shown . 

Figure 28: DMA sequence and corresponding aminoacid sequence 
of EBV-related proteins. 

A. Protein p54 

Nucleotide sequence and derived aminoacid sequence 
of protein p54 which is identified in in vitro 
translation as. p47" but correlated with immuno- 

precipitation with monoclonal antibodies 

3. Protein c90 

C . Protein- p-1 43- ■■ 
0. Protein d150 
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Figure 29: Expression of the B-gal::p150 fusion" prof^i nc 



IPTG-induced clones indicated on top were separatecL'.af ter 
lysis in an 10 % SDS-PAGE and the proteins were stained 
with Coomassie-blue. As a control pUR288. was . applied- to 
show the size of the B-galactosidase . All clones produce 
new proteins larger than the control clone and correspon- 
ding to the insert size. 
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Figure 30: Antigeni city of the B-gal::p150 fusion proteins 

The same lysates from clones shown in Fig. 29 were trans- 
ferred to nitrocellulose and EBV-related antigens were 
visualized by immuno staining (see supra) . The clone en- 
coding the N-terminal part reacts strongly. 



Figure 3 1: Map of the p150 encoding region 



The p150 encoding region is. -shown as dark bar. The re- 
20 stricion. sites use<T~f or subcloning'and the "resulting 
pUR-clones are also indicated. 
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Best: mode of carrying out the invention . 
Zxamole 1 

I dentification of an antigen suitable for diagnosis of MPC 

in order to obtain the desired DNA sequences coding for 
E2V-related antigens of diagnostic significance the 
following siratecv was developed: 
T.rnmunooreciDitation of Epstein-3arr viral proteins wi-h 
various sera from, normal adults, patiencs wi^r. rresn 
infectious mononucleosis or nasopharyngeal carcinoma 
was used to identify antigens, which are of relevance for 
the diagnosis of immune status and characteristic a 
particular disease (Figure 1). These antigens have been 
localized on the Sostein-3arr virus genome by hybrid -se- 
lected translation. With the use of sequence daca, these 
genes were subcloned from E3V-DN& and expressed in eu- 
carvotic and crccarvotic ceils. 

It was shown by immunoprecipitation that IA and VCA are 
not sinaie antigens but families of antigens that consist or 
several polypeptides {G.J. 3ayliss, S . Wolf, "The regu- 
lated expression of Eostein-3arr virus. III. Proteins 
specified by E3V during the lytic cycle" , J. Gen. Virol 56, 
p. 1o5 (1981 ) ) . 

For the immunoprecipitation the £3V-procuc ing , ma -positive 
ceil line ?3HR1 , the F3v-pos itive , non-producing Raji ceil 
line and the Z3V-negative cell line 3 JAB were used. When 
the cells reached a density of about io°/mi, chey were 
diluted with an equal volume of fresh medium. For induction 
of Z3V antigens, ? 3 E3.1 cultures were treated with 4c nc/ni 
chorboi-1 2-mvstrate- 1 3-acetace (modified from zur rlauser. 
et al ( H . zur Hausen, F.J. O'Neill, U.K. Freese, Z . Keeker , 
"Persisting oncogenic heroes virus induced bv the turner 
promoter T?A" , Mature 272 , p. 3 7 3* ( 1 973 )} an d 3 31M butyric 
acid immediately after subculture. For the Labelling of 
-he orotains , the ceils were collected by low-speed centri- 
f uaa Lion and re s usee nee c at a density of 2 :< 1o -cs:i$/^i 
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1 . in methionine-free MEM culture medium containing between 
5o and 1oo uCi/ml 35 S-methionine. The cells were incubated 
at 37°C/5 % CO^ for 4 h and subsequently washed with cold 
Hanks' phosphate buffered saline (PBS) and resuspended in 

5 cold IP buffer (1% Triton-X-1oo , o.1% SDS; o.137 MNaCl; 

1 nM CaCl 2 ; 1 mM MgCl 2 ; 1o% glycerol; 2o mM Tris-HCl pH 9.o; 

o,o1 % NafcU; 1u.g/ml pheny Ime thy 1 sulphonyl fluoride) at a 
concentration of 5 x 1o cell/ml. Then the cells were dis- 
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rupted by sonication and incubated on ice for 6o min. The 
extracts clarified by centrif ugation at 1oo,ooo x g for 
3o min at 4°C. 

35 ... 

S-methionine labelled extracts were immuno-precipitated 

exactly as described (G.J, Bayliss, et al., supra). 

The results are shown in Fig. 1 . 



Antibodies to pi 38, p1o5, p90 and p80 are present in each 
of the MPC sera and only in some of the other EBV- infection 
specific sera. In analogy antibodies to p54 (identical 

2Q to-p-58 in"G.J. Bayliss et 'al., supra) are significant for 
fresh EBV- infection (infectious mononucleosis) as compared 
to convalescent state. Antibodies to p150, p143, p110 
are also present in convalescent sera of healthy indi- 
viduals and_ can serve as markers for immunity or, in 

25 connection with IgM specific tests for fresh EBV- infection 
or, in connection with IgA specific tests, for EBV-related 
neoplasia (NPC and BL) . 

The next step was to localize the antigens on the 
30 EBV genome. Therefore RNA was prepared by lysing the 

EBV-producing- cells described above with 4 M guanidine 
* "isothlocyariate arid o.5- M 2-mercaptoethanoi two days- after 

induction (J.M. Chirgwin, A.E. Przybyia, R.J. MacDonald, 

W.J. Rutter, "Isolation of biologically active ribonucleic 
35 acid from sources enriched in ribonuclease" , Biochemistry 

18, p. 5294, (1979)). The lysate was centrif uged for one 
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hour at 20.000 rpm (SW 41, Beckmann) and the supernatant 
layered on top of 2 ml CsCl density 1.8 g/cm 3 . After 
centrifugation for 17 hours at 150.000 g, the RNA pellet 
was extracted with chloroform and precipitated with 
ethanol. 100 ug total cellular RNA was hybridized for 
2.5 hours at 52 °C in 65 I formamide and 0.4 M NaCi to 
16 \xc cloned Z3V-DNA , which was sonicated, denatured 
and sported on small nitrocellulose ■ filters . Bound mRNA 
was eiuted by boiling the filters 90 sec in water. -The RNA 
was translated in vitro with a mRNA dependent rabbit reti- 
culocyte lysate. The translation products were immuno- 
precipitated using 5 nl of a pool of human NPC sera for on 
assay after preincubation with a protein extract from un- 
labelled EBV-negative 3JA-3 cells as previously des- 
cribed (G.J. Bayliss, G. Deby, H. Wolf, "An immunopreci- 
pitation blocking assay for the analysis of EBV induced 
antigens", J. Virol. Methods 7, p. 229 (1983)). The 
immune complexes were bound on protein A-sepharose, washed 
eiuted by boiling the beads in electrophoresis ' sample 
buffer and loaded onto SDS-polyarcrylamid gels. This 
procedure allowed mapping of a number of viral proteins 
(Fig. 2) relative to the EBV B95-3 genome. The localizatio 
of p138, is given in Fig. 2. Using sequence data . 
(R. Baer, A.T. Bankier, M.D. Biggin, P .L.Deininger , P.J. 
Fawell, T. J. Gibson, G. Hatf ull , G.S. Hudson, S.C. Satch- 
weli, C. Seguin, P.S. Tuffuei, 3. Barrel! "DNA-sequence 
and expression of the 395-3 Epstein-3arr virus genome " , 
Nature 31o, p. 2o7 (1984)), appropriate open reading 
frames for pl.38 and p54 were .identified . (Fig .-.„2.L, These., 
open reading frames are completely contained _in the right 
part of the .3amA-f ragment at the right ^end. or the viral 
genome . ........ 
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Example 4 •"- — -■' - 

Purification of the B-gal : pi 38 fusion protein encoded 
by the plasmid p[JC635 ... ~. . 

The clone E.coli K12 JM109 puC 53 5 was grown at 37°C 

in 5oo ml L-broth supplemented with Ampicillin as* described 
above until the OD-^ was o.8. The fusion orotein - svnthesis 
was induced by IPTG (1 mM) and the incubation was continued 
for another 2 h. Then the cells were collected by centri- - 
fugation for 1o min in a GSA "rotor (Sorvall) at S.ooo rpm 
and they were resuspended in 5o ml 2o mM Tris-HCi, pH 7.5. 
For lysating the ceils, EDTA (So mM final concentration) 
and lysozyme (2 mg/ml final concentration) were added and 
this mixture was incubated for 3o min at 37°C. Ia the 
following, the cells were sonicated (Labsonic 151o, Braun) 
twice for 8 min, Triton X-loo was added to a final con- 
centration of 3 % and, . after further incubation' a t 37°C 
for 3o min, insoluble particj.es of the suspension were 
pelleted by centrif ugation (SS 34 rotor (Sorvall) ,"■ 2o min, 
lo.ooo rpm) . The resulting pellet was dissolved in 2o ml 
of an 8 M urea, r 1o mM Tris-Hcl"/~o, 5" %~ B-mercaptoethanor, 
pH 7.5,. solution and recentrif uged as before.' 

Finally 80 mg of the proteins were subjected to a column 
chromatography (Sepharose 23-Ci (Pharmacia) , length-- 80 cm, 
diameter 3 cm) with 3 M- urea,- -10- mM Tr is ,- 0". l-.%-.S-mercapto- 
ethanol, pH 7.5, buffer. 30 \xl of ' each of. the collected 
4 ml samples were analyzed in a 15 % PAGE and the fusion 
protein -containing fractions were pooled. 
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Example 5 

Cloning of p138 subregions coding for antigenic determinants 
identified by computer analysis 

In principle in diagnostic tests only the antigenic de- 
terminant subregions of ■ the antigenic protein are needed. 
Therefore the p138 amino acid sequence was analyzed bv a 
computer programm and the identified subregions of this 
gene were introduced in suitable vectors. The production 
of such small proteins has the advantage that these are 
less vulnerable to rapid changes of antigenicity with 
decreasing length of the product. Furthermore especially 

in conjunction with assays for class specific antibodies 
they wiii be of diagnostic value. 

According to the method of P. Chou and G. Fasman 
("Conformational parameters for aminoacids in <<-helical 
fl-sheet and random coil regions calculated from proteins", 
Biochemistry. 13 , p. 211 (1974)) the calculation of the 

appropriate secondary, structure of a protein caused by 
its aminoacid sequence (primary structure) is possible. 
Superimposed on_the._ suggested structure , the program de- 
termines the relative hydrophilicity and hydrophobicity . 
Both data sets are combined and a computer graphic is 
drawn that shows ^-helical, B-sheet, 3 -turning and randomly 
coiled regions- of_. the secondary- structure . Thereby the 
hydrophilic and hydrophobic regions. are shown as open and 
closed' circles , resoecrivelv . 



An example of such computer graphic is* shown for the 
pi 38 amino acid sequence in Fig. 9. 
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All expressed proteins display almost the expected size, 
but. the . yield varied over a wide range. The proteins enco- 
ded by pCJC635 and pMF924, seem to be more stably express- 
able than the non-fusion proteins from pUC924 and pKK378. 
However, the amount of even the highest expressed protein 
from pUC635 is too low for a large-scale production since 
in the Coomassie-stained gel only a very weak band was 
visible which may be due to the large size of the eukary- 
otic protein. 



Example 3 

Immunol ogical assay of the proteins encoded by pUC635 
PUC924, PMF924 and pKK378 

The host cells transformed with plasmids p(JC635, p(JC924, 
PMF924 and pUK378 were cultivated in L-broth supplemented 
with 50 ng/ml Ampicillin to a cell density of D g0( ;=0.8. 

Then, f or the-induction of the B-galactosidase the lactose 
analogon isopropyl-3-D-thiogalactopyranoside . ( IP.TG ; Sigma ) 
was added (final concentration: 1mM) After a further 
incubation of 1.5 h at. 3.ZiC^_t-5.-ml , of the. culture. -were.. .. 
centrifuged. The bacteria were : re suspended in 2oo nl boiling 
nix (2 % SDS, 5 .% mercaptoethanol ,- 3-% sucrose , -5o mM 
Tris-3C1, pH 7.o) and heated for 1o min at 1oo°C. 
2o ul of the resulting -protein extract were- separated on 
a 12.5 % polyacrylaraide gel and. finally the ..proteins were 
visualized by coomassie-blue^ staining, but. : since : the 
yield of the expression product- is very low, an T immuno- 
staining was necessary. Therefore the electrophoretically 
separated proteins were transferred to a nitrocellulose 
filter, i.e. a "Western-blot^, was prepared (J. Renart, 
J. Reiser, G.R. Shark "Transfer of proteins from gels 
to diazobenzyl -methyl-paper and detection with' antisera" , 
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. - c^n 76 o 31 16 (1979), S. Modrow , 

1 9r OC . Matl. Acac. Sci. USA /b, p- 

. . „ h^nes virus saimir i ana 
s Wol-, "Characterization or ne^pesvixu* 

ateies induced proteins", in: Latent Herpes 
infections in Veterinary Medicine, Martinus Hijnor: .unl., 
5 a 1o= (1984) ) . 

m 

r 

The Western-blo. was prepared with a current intensity 
of o.3 A for 3 h in Western-blot buffer (72 g glycin, 

i a n iP5t ad 5 1) . Then the 
« c ^ t-t-hs 1 1 methanol, H 9 0 aesw. 

i r> ..(-,«fed wi*h Cohen buffer for 3 h 

10 nitrocellulose was saturated wi.a 

,o 1 % Ficoll 4oo, 1 % polyvinylpyrroiicone, 1.6 % SSA. 
o ; \ \ 4o, o.oS % gelatine,. o.17 O3BO3, 28 SaOH 
Via rt"ad. 6 mM N a« 3 , P H 8.2) and incubated overnignt 
with 1:5o diluted high titered EBV specific serum .rom 
15 Mpr-patients. The serum had been preabsorbed to a bacteria - 
orotein extract (1 *l/1o 9 E. coli ceils, to reduce the _ 
oacterial protein generated background. Afterward, unoouna 
tog was removed by washing the nitrocellulose riiter =or 
"s'h in aelatine buffer (So mM Tris-HCl, 5 mM EDTA , iSo »M 
Had. o.2S % gelatine, 0.5 % Triton, o.2 % SDS, pn s .0) . 

n,*mr -he blotted SBV-soecific proteins raDDK 
For visualizing *-ne axot. — ^ — 

anti-human-IgG-antibodies coupled to peroxidase ana 
dieted 1 : 200 in TN buffer (154 tnM MaCl, lo mK Tris, P* 7..) 

. , . 2 w at RT, unbound rabbit antibodies were 

was aaded. Alter i n at ki , . . . 

rea oved by washing with gelatine buffer as 

above. Finally the peroxidase. .reaction was carriea out _n 

m . „ r1 oH 7 5 bv adding 5o rag aiamir.o- 
1oo ml 5o iuM Tris-.-iCl, ?H / . 3 / oy ^ 

- . i a n ^nd ^ ncubatinc 1o ulin- <a<- 
benzidine (Sigma) ana 4o nl 4 2 0 2 anc -ncao . ^ 

- . , • e ovnpr ^ me nt are shown in cig. ^. 
RT. The results or this experiment 

30 



20 



25 



35 
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(S g yeast extract, 1o g tryptone, 5 g NaCl) incubated 

1 ' 5 . h :^ t .. 3 _ 7 ° C : a . nd . finall y Plated on L-broth agar-plates 
(1.5 %) supplemented' with 5o ng/ml Ampicillin (Sigma) and 
4o ug/ml X-gal (3oehringer) . During this incubation 
bacteria carrying religated pCJC8 molecules yield blue 
colonies a nd those which carry recombinant plasmids yield 
white colonies. 



For identification of clones that carry the desired re- 
combinant plasmid, twelve white colonies were picked and 
grown overnight at 3 7"C in L-broth. Aliquots of DNA -pre- 
parations according to H.C. Birnboim and J. Doly ("A rapid 
alkaline extraction procedure for screening recombinant 
plasniid DNA" , Nucl. Acids Res. 7, p. 1513 (1979)) were 
digested by BamHI and HindHI and electrophoresed on an 
agarose gel as described before. Furthermore, for demon- 
strating the orientation of the integrated 'fragment , a 
digest was carried out with BamHI and Bglll. Finally 
the 3.3 kb was checked by a Xhol digest. 

Plasniid pUC635 carries the 3 . 0 kb Xhol-subf ragment of 

the BamA-fragment (pBR3 22 BamA) in the proper orientation 

and the proper reading frame relative to the lac CJVS 

Pr .° m0ter and is used for the expression of nearly the 

whole p 138 (Fig. 4) . The fusion protein encoded by pUC635 is composed of 12 

amino acids of the B-galactosidase amino" terminus , about 

1o2o amino acids of p138, 6o amino acids" of the carboxy ' 1 
terminal part of the B-galactosidase and another 29 amino " " 

acids of a pBR322 encoded region . Plasmid pOC61 30 carries the 
3.3 kb fragment in the opposite orientation (Fig. 4) Since 

the- strain E.coli K1 2 JM83 is not a Q-galactosidase- ; -- 

repressor overproducer , the fusion protein is constitutively 
expressed. Therefore, the. plasmid pCJC635 was introduced.- -, 
into, the S-glactosidase repressor overproducer strain. 
E.coli K12 BMH71-18 (DSM 3413) (U. Ruther, B. Muller-Hill, "Easy- 
identification of cDNA clones", EMBO Journal 1o, p. 1791 
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(1983)). Instead of strain E. coli K12 MBH71-18 strain 

OSM 3423) . , , x. 

E.coli K12 JM109/can also be used (without essential altera- 
tion of the experimental procedure) . 

Besides pUC635 three other plasmids were constructed : pUC924 , 
pMF924 and pKK378 (Figures 6 to 8) . 

The insert of pKK378 starts at the same Xhol-site and con- 
tinues up to the third Xhol-site located 250bp 3' of the stop 
codon. This fragment of 3.3kb was generated by an incomplete 
digest and inserted behind the tac-promotor and the start co- 
don of pKK240-11 (F.Amann et al., supra). The expression pro- 
duct contains only two bacterial amino acids and its size is 
smaller then the size of the expression product of pUC635 
because the bacterial lacZ part is missing. 

pUC924 contains the fragment from the Bgl II-site to the 
third Xho I site, pUC*9^wa 3 s 2 used as vector. Since the size of the 
insert is smaller than in pUC635 and since the stop codon from 
p138 is used, the molecular weight of the expression product 
is expected to be smaller than in pUC635 and pKK378. 

The plasmid pMF924 was constructed fran pEA305 (E. Amann et al., supra) 
and the same Bglll-Xhol fragment as in pUC924. oEA305 has a tac-promotor 
followed by the N-terminal part of the C1 repressor, the 
resulting fusion protein is expected to be 17kd larger than in 
pUC924. 

These constructs were tested for the production of EBV-related 
antigens by inducing the -tac- and lac-promotors with IPTG and 
separating the proteins on an SDS-PAGE. None or only weak 
new bands could be -detected on Coomassie-blue stained gels in 
the regions with the expected sizes. But after a transfer of 
the proteins onto nitrocellulose and immunostaining with a 
high titered NPC-pool serum and a peroxidase conjugated second 
anti-IgG antibody new EBV-specific bands were clearly detec- 
table in all constructs. (Fig. 4) 
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Example -2 - - - . . 

Cloning of the p138 encoding region 

According to the sequence data of R. Baer et al., (supra) , 
there is a large open reading frame contained in the 
3amA-f ragment of EBV 395-3 which is suitable for encoding 
p 1 38 .- The nucleotide sequence,, the corresponding aminoacid sequence 
and the respective regulatory . elements of the gene of 
pi 38 are given in Fig. 3. 

50 tig DNA of the plasmid pBR322-BamA (J.Skare et al.,. supra) were .di- 
gested with 50 U Xhol (Boehringer) fcr 2 h .at 37°C in a total volume 
of 15o ul containing 15o mM NaCl , 1o mM MgCl 2/ 6 mM mer- 
captoethanol, . 6. mM Tris-HCl, pH 7.9. 3o ul stop buffer 
(To mM Tris-HCl, 5o mM EDTA,"6o % sucrose, 1 % bromphenol- 
blue, pH 7.5) were added, the mixture was put onto a 
preparative 1 % agarose gel. in acetate-buffer (o.o4 M Tris- 
acetate, 2 mMJEDIA-,- pH 7.6), and electrophoresed for. 16 h 
-with '4b V at 4°C. As a size marker Hindlll digested A.-phage 
DNA (Boehringer) was used. After staining the gel in Tris- 
acetate buffer .with ethidium bromide (0.5 \xq/ml) for 
1 h-at room temperature (RT) , the DNA was visualized by UV- 
illumination and the bands corresponding to 3.o and. 3.3 Jcb 
were excised* (the 3.o kh Xhol-generated fragment is the 
desired fragment, .the 3 .3.- Jcb. Xhol-generated fragment is 
a partial digest product (one Xhol restriction, site was 

not CUt) )•;--■ . i :.. " . 

The. DNA. of .the-, bands was - eluted by putting the agarose 
pieces? intovdialys is™ bags 7 -adding 3. volumes.rof ..Tris-acetate 
burfer and electrophoresed-' for-4' h M00V, 4°C) . Further 
purification- was carried, out- by a chromatography with 
Elutip:- D" columns- (Schleicher- & Schuell) - according to the 
procedure recommended by the. manufacturer ,' extraction of 
the contained ethidium bromide with isoamylalcohol and pre- 
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cipitation of the DNA by adding 2.5 volumes ethanol and 
incubating overnight at -2o°C. The DNA was collected bv 
centrif ugation in a Sorvall 53 34 rotor (17.ooo rpm, 2o 
min) and washed with 7o % ethanol. After lyophilization 
the DNA was dissolved in 1 5 ul TE buffer (1o mM Tris-KCi, 

i mM ZDTA , pH 7.5). 

The DNA concentration of the two isolated fragments was 
estimated by eiectrophoresing 1 ul each in parallel with 
1oo ng and 1 ng of pUCS DNA. 

Sail digested DNA of the vector pUC8 (deposited with the Deutsche 

Saitmlung fur Mikroorganism*n (DSM) , Gottingen , West Germany, 

under the accession number DSM 3420 ) (J. Messing, J. Vieira, 

of double-digest restriction fragments", Gene 19, p. 269 

(1982)) was prepared as described before, excepr that for 
inhibition of religation of the vector during the following 
ligase reaction the DNA was treated with alkaline phosphat- 
ase (o.5 units (Boehringer) , 3o min at 37°C) . 

In the following, the two purified fragments were each 
inserted into the cleaved vector (Sail and Xhol oroduce 
the same cohesive ends, i.e. -TGCA-) . For this ouroosa 
for each of the fragments a ligation reaction was carried 
out with 3oo ng fragment DNA and 1oo ng pUC3 DNA in a total 
volume of 2o ul ligase buffer (1o mM Tr is , 1o mM MgCl 7 , 
5 mM mercaptoethanol , o.S mM AT? , ptf 7.5) containing 
I U T4-DNA ligase (Boehringer). After 2o h at 14°C, 30 ui 
TE buffer and 200 ul competent E.coli JM83 cells (ATCC 35607) (J. Vieira, 

J. Messing, "The pCJC piasmids, an M1 3mp7-der ived system 
for insertion mutagenesis and sequencing with synthetic 
universal primers", Gene 19, p. 259 (1982)) were added. 
The transformation was done according to the calcium 

chloride procedure (M. Mandel, A. Higa, "Calcium dependent 
bacteriophage DNA infection", J. Mol. 3iol. 53, p. 154 
(197o))- Then the cells were mixed with 1.5 ml t-broth 



•': .0.1.73254 

- . 

••■ — ... 

- 53 - 

Based on the assumption that antigenic sites are mainly 
located, in. hydrophilic B-turns. which, are located on the 
surface of the protein, the region between about amino 
acid 52o and the carboxy-terminus of p138 should* be "anti- 
genic. The corresponding DNA sequence is- represented bv 
a Pstl-fragment of paC63 5. 

Thus pUCS35 was cleaved with PstI and- all Pstl-f ragments 

were isolated and introduced into PstI -cleaved pUC8 , 
the remaining vector fragment with additional 400 bp (up 

to the first Pstl-site of the pi 38 "coding sequence) was 

religated (all methods as described in example 2). 

The resultina recombinant olasmids were desiemated oDC 
P400 , puC P380, pUC P600, pQC P210,"pUC P750, and' 
pQC ?540 ,' respectively. 

The aminoterminal region of the p138 enco_ding sequence was 
cloned, by digest incjthe- plasmid p3R3 22-BamA with PstI and 
HgiAI and- inserting said fragment .into PstI cleaved pUC9- 
(J. Hessing et al., supra) (methods as. described, in ex- 
ample 2) . The resulting recombinant plasmid is designated 
oUC HP. ' " - "~ " 

With the exception of pCJC" HP -in which translation stops 

3lz the 3* end of the insertion, in all subclones orientation 

and reading frames relative to pUCS are correct. • 

Finally the recombinant plasmids were 'introduced "into : - - 

4 

2. coli K12 JM "109- " cells. * " 
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1 Exanple 6 



5 



25 



30 



Expression of" antigenic determinants identified by computer 
analysis using pUR288 pi 38 subclones 



Since pUC subclones (example 5) sontimes are not stablv 
expressible in bacteria, because they cannot build up 
a suitable tertiary structure due to their shortness 
and therefore can be degraded by proteases to a larger 

10 extent than carrplete proteins we constructed recctnbinant plasmids encoding 
large fusion proteins using at least a part of the B-galactosidase 
encoded by pUR288 (DSM 3415) (U. Rather et al., supra) by cleaving 
said Pstl-fragment subclones with BaraHT and Hindlll, isolating the 
respective* fragments and ligating than into BamHI and Hindlll 

15 cleaved pUR288 (all methods as described in example 2) . 

The expression was carried out in E.coli K12 JM109. 
The products were analyzed as described in example 3 . 
After coomassie-blue staining of the gel several large 
20 fusion " proteins of different size were detected , however, 
after preparation of a Western -blot , only the produces 
expressed by pUR6oo and pCJR54o showed specific reaction 
with the IgG antibodies mentioned (Fig. 10) . 



These results are in good agreement with the computer 
analvsis . 



Additionally the expression of the clones obtained 
according to example 5 was carried out according to 
example 3. The products, too, were analyzed as described 
in example 3. From the coomassie-blue stained gel it 
can be taker*. that only plasmids pUCP600 and pUCP380 code for a 
stable fusion protein. The Western-blot shows that only 
pUCP600 derived fusion protein is antigenic (Fig. 11). 
35 This fusion protein contains 11 amino acids encoded 



0173254 



- 55 - 

1 by the aminoterminal cloning site,' a region encoded by * 
about600 bp of p 138 and" carboxy terminal* amino acids of 
the "lac 2 gene. "Thus, the- recombinant expression plasmids 
pUR600 and pUR540 as well as pUCP600 can be used for 

5 the- - product ion of large and small fusion proteins, 
respectively, containing an antigenic determinant 
of EBV-protein p138. 

Example 7 

10 

Application of the protein encoded - by. plasmid. pQCP6Q0 for th e 
stabilization of per se unstable parts of eukaryotic 

15 ... . ; 

By means of the experiments of example 6 it was shown 
that the p138-derived protein parts (regions) are 
unstable with the exception ' of the protein encoded by plasmid 
pUCPSOO. the second antigenic region from the C-terminus of , 
20 _ p.138- (p540', see Fig. 9) is not stably expressible using the 
recombinant pUC-vector pOCP540. 

The- ability of the P600-region of p138 to stabilize 
such a per se unstable expression product is shown in 
this example. 

25 

For this purpose it was necessary to remove the 
5 1 -PstX-site ofpHS^OO by digesting the plasmid with SstI 
and- Hindll I (the SstI site is located about 20 bp 3 1 from 
the first PstI site). The p138-related Sstl-Hindlll frag- 
30 ment was iaserted into -Sstl/Hindlll cleaved pUC12 (DS*3422) 
J. Messing, "New M13 vectors for cloning", in Methods of 
Enzymology Vol.. -101,- Part. C. ,. R. Wu, L . GroBmann and 
K. Moldoave (eds.), Acad. Press, New York, 1983, 20-78). 



35 
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Then the resulting 

recombinant plasmid was digested with EcoRI. and Pstl. 

The obtained- 600 bp fragment was inserted into plasmid pUCS. 

The 5' -Pstl site was now replaced by an SstI site 

and thus the reading frame is reconstituted, at . the 

3'- and the 5 '-end of the insert (Fig. 12a). The resulting 

recombinant plasmid pUC601 still expresses a stable 

product (Fig. 13). 

Between the Pstl and HindHI site at the 3 '-terminus of 
the EBV-encoded sequence a synthetic oligonucleotide 
obtained according to known methods coding in fraiae for 
5 arginine and 2 stop codons was inserted as shown in 
Fig. 12b). The resulting plasmid pCJCARG6 01 encodes, the 
P600 region of p138 fused at its C-tenninus to 5 ar- 
ginine residues. 

In a last step the Pstl fragment encoding the P540 region of 
p138 was ligated to the Pstl fragment encoding the 
P600 region of pi 38 after digestion with Pstl. The 
resulting recombinant plasmid pUCARG1140 encodes a 
stable protein of about 43kd which contains two 
antigenic sites of pi 38 fused in frame. In this fusion 
protein the protein region P600 stabilizes the protein 
region P540 (Fig. 13). The arginine residues at the 
cartoaxyterminus of the expression product may be used for 
the purification of the resulting fusion protein as 
described by Sassenfeld and Brewer, (supra) (Fig..- 16). 

. . . Example 8 , - •> :v 

Construction of the recombinant plasmid pUCARG680 

From the plasmid pUCARG1140 a modified version was con- 
structed which lacks 435bp of the pi 38 encoding region, 
the Oterminal part of the p600 fragment and the N-ter- 
minal part of the p540 fragment. The main antigenic 



sites predicted by the computer program are still present 
The-plasmid was" designated- as pUCARG680 and its con- 
struction was achieved by digesting pUCARG1140 with Ncol 
(cleavage site coresp onds to bpl841 and bp3243 in 
Fig. "3). Since the' reading T frames in : the p600 Ncol site 
and the p540 Ncol site do not fit, the sticky ends were 
removed with S1 -Nuclease. 

30 ug of pUCARG1140 were digested with Ncol, the 3.3kb 
vector-p138 fragment was separated by gelelecrophoresis 
and purified. 5ug of this DNA fragment were digested 
with 100 units S1-Nuclease for . 15 min at roomtemperature 
in 100 ul containing 33mM Na-acetate f 50 mM NaCI, 
0.03mM ZnS0 4 , pH 4.5. The digest was stopped by phenol 
extraction. After precipitation with ethanol the DNA 
was religated with T4-DNA ligase and used to transform 
competent' : E. col i* K1 2 JM109 1 cells. The resulting clones 
were screended .for the. appearance of a new protein with 
30kb in size (pUCARG680) . The shortened p600/p540 fusion 
protein encoded^by pUCARG680 still reacts as an antigen. 

The newly constructed recombinant plasmid pUCARG680 was 
deposited with the DSM under the deposition number DSM3408 

Example 9 

■ - ~ 

• * 

Assay of the antigenicity of the fusion protein encoded 
by plasmid pUCARG1140 

Immunoblots with the fusion proteins encoded by. the 
recombinant plasmids p(JCARG1140, pUR540, and p(JR600 
(examples -6 and. 7) using.. individual NPC-sera reveal 
-that the immunological reactions differ in..... . various 

patients (Fig. 14). . In this context it has to be under- 
stood : .that said plasmids encode fusion proteins contai- 
ning the p138 regions P540 + P600, P540, and P600, 
respectively (see Fig. 9). 
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Whereas in NPC serum no .352 the main fraction of the 
IgG and IgA antibodies is directed to the P540 region, 
the main fraction in NPC serum no .354 is directed to the 
P600 region of p138. A representative pool prepared fron 
many sera from NPC patients did not detect additional 
antigenic sites. The conclusion from this finding is that 
the antigenic determinants P540 and P600 as encoded by 
the recombinant plasmids of the present 

invention are necessary and sufficient to achieve the 
desired specificity for ELISA tests useful for diagnostic 
purposes . 

Example 10 

A pplication of plasmid; pUCARG1140 encoded fusion protein 
for the detection of NPC in ELISA tests 

The purified fusion protein encoded by pUCARG1140 was 
coated on micro-titer plates. Ten individual NPC-sera 
were tested for their IgG and especially for their 
IgA reactivity. The IgA-anti-EA titer of these sera was 
previously determined in conventional immunofluorescence 
tests. The highest titer found was 1:80. In the ELISA 
test shown in Fig. 15, two EBV-negative , one NPC-serum 

pool and ten individual- NPC-sera_ were . tested... up. to a 
dilution 1:10640. The test was_ performed according to 
the usual ELISA protocol. Bound antibodies were detected 
with peroxidase conjugated mouse anti human IgG, -i.e. 
IgA and peroxidase reaction.- All NPC sera "show a reaction 
with the coated antigen (up to 1:2560 in IgA) and no 
background reaction could be observed in the negative 
controls. This result indicates that the pUCAKG1140 
encoded expression product is suitable for- the diagnosis 
and early detection of NPC. 
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1 Example 1T 

Cloning of a subregion of the gene coding for gp 350 in 
the vector pUC8 

The coding region of gp 250 and gp 350 was mapped to the 
Bam L-fragment (J. Skare et al. , supra) of the EBV B95-8 genare. 
As both polypeptides share identical regions it was supposed that 
both proteins are encoded by overlapping reading frames 
(M. Kuramei, D. Thoriey-Lawson , E. Kief 5, "Epstein-Barr 

10 virus DNA fragment encodes messages for the two major 
envelope glycoproteins (gp 350/300 and gp 220/200)" , J. 
of Virol. 49, p. 413 (1984)). The sequence data of 
Baer et al. (supra) revealed a large open reading frame in- 
cluding a donor splice site and an acceptor splice site in said 

15 Bant L-fragment of the virus genome (Fig. 17 3^ -jg) # 

It is assumed that gp 3 50 is the translation product 
of the unspliced mRNA transcribed from this region and 
20 gp-250 is a product of the corresponding spliced mRNA 
(Fig. 17-). Since both products are found in the viral 
capsids it is assumed that a differential splicing of 
-said mRNA.. in a manner, comparable with the immunoglobulin 

.^^^te _ ... ^^^^^^^^^^ 

heavy, chain genes (T . Hon jo , ■ "Immunoglobulin genes", 
25 Ann.. Rev. of Immunol. .1 , p-. 499 (1983)) takes place. 

During -this splicing 630 bp of. the mRNA coding for gp .350 
are removed to yield the gp 250 coding mRNA. (Fig*.. 17 and 
27 (dotted lines)) (R. Baer et al. , supra). 

* » — . 

30 Therefore, the whole or a part of the reading frame of 
gp 350 was cloned for finally isolating and producing 
a av 350 related oroduct. It should be kept in mind, that 
not only, gp 250 but also gp. 350 are highly glycosylated 
proteins. In contrast, the proteins produced by expression 

35 

of the recombinant DNA molecules according to the present 
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invention, differ from the respective viral proteins 
normally occuring in nature. If expression is carried 
out in prokaryotes unmodified^ proteins are obtained 
whereas expression in eukaryotes gives proteins with 
different patterns of glycosylation or else modifications 
as compared to the natural product. 

The Bam L-f ragment was introduced in pBR322, and E. coli 
K 12 HB 101 was transformed with the recombinant plasmid 
obtained. <j. Skare, et al., supra) 

Instead of the host E,coli K12 HB101 the host bacteria 
used in the present invention can also be used. 

The contents of the publications of M. Hummel et al. (supra) 
J. R. North et al. (J. R. North , A. J\ Morgan, J. L . 
Thompson, M. A. Epstein, "Purified Epstein-Barr virus 
Mr- 340 .000 glycoprotein induces potent virus-neutralizing 
antibodies when incorporated in liposomes" , Proc. Natl. 
Acad. Sci^-GSA 79, p. 7504 (1982)) and D. A. Thorley- 
Lawson and C, A. Pcodry ("Identification, and Isolation 
of the Main Component (gp3S0-gp220) of Epstein-3arr Virus 
Responsible for Generating. Neutralizing Antibodies In Vivo", 
J. Virol.' '43, p. 730 (1984) do not permit predictions that 
subregions- of the gp 250/350 encoding sequence are coding 
for sufficiently antigenic and/or immunogenic proteins 
and" 'that thesa products '-after- selective introduction of 
these subregions' can be' stably expressed in prokaryotic 
and eukaryotic ceils. It is therefore surprising that 
coraoletelv unmodified or in a different way modified 
cp 250/350" related proteins of the present- invention 
are sufficiently" "active .antigens and/or immunogens . 
In particular in previous publications it was not ex- 
cluded that minor carbohydrate residues of the- protein 
contribute "significantly to the antigenic or immunogenic 
potential of this protein.- ' " 
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1 As shown in rig. 17, a 1 . 9 kb ■ Pstl-Pstl-fragment "of the 

3ara L-fragment {HummeL et al., supra) contains the part 

of the gp 3 50 coding region beginning at aminoacid 
position 232 and ending at aminoacid position 825. 

5 

A iarae scale preoaration of the pBR 322-BamL plasmid DNA 
was dene according to the method published by H.C. Bim- 
bo in and J . Doiv ("A raDid alkaline extraction procedure 
for scraen.inc recombinant oiasmid DNA", Nucl. Acids Res. 7, 

10 

p. 1 5 13 ( 1979)). 50ug of this DNA were digested for 2 
hours at 37°C with 100 units PstI (Boehringer) in.: 50 
mM NaCi, 10 mM MgCi-, 1 mM DTT , 10 mM Tris-HCl, pH 7.5. 
The digestion was stopped by addition of 1/5 vol. 50 mM 
EOT A , 50 % sucrose, 2 % ' bromphenolblue . -The-^resulting 

15 

solution was eiectrophoretically separated on a- 1 % 
acarose aei (Seakem, FMC) in Tris-acetate buffer (0.04 M 
Tr is-acetate , 2 mM EDTA, pH 7.6). As a size marker HincIII 
digested A--?hage DNA . (Boehringer ) was used. After the 

» 1 ^czroohor'esis at 40 V for 14 hours at roour temoerature. 
20 ' . " 

(RT) , the gel was stained in Tris-acetate buffer containing 
0.5 ug/ml ethidium bromide. 



The DNA bands in the gel were visualized by UV-illumina- 
25 tion and the 1.9 kb Pstl-PstI fragment was isolated as 

described in example 2. 

PstI digested DNA of the vector p(JC8 was prepared as 
30 described' before , except -that for inhibition of., reiigation 
of the vector during, the- following, ligas.e reaction, the 
DNA was treated, with... alkaline, phosphatase. (0.5. units 
(Boehringer ), 30_ min- at. 37?C) . - . . : 

" - • ' * » - •* « m i^te 

35 The concentration of the purified fragments was . estimated 

by eiectrophoresing 1 nl each in parallel with .100 ng and 
500 ng of pUC8-D-NA (under conditions described above) . 



J 
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400 fig of the 1.9 kb Pstl-Pstl-f ragment and 100 ng of the 
PstI digested vector DNA were ligated, E.coli K1 2 JM109 
was transformed with the ligated plasmid DNA and positive 
clones were identified as described in example 2. 

The obtained clone was designated £ . coli K1 2 JM109 
pUCLP1.9 and the resulting recombinant plasmid pUCLP1.9, 
respectively . 

Example 12 

Cloning of a subregion of the gene coding for cp 3 50 in 
the vector pCJR290 

?or the expression of- a stable product of the op 350 
subregion said 1.9 kb Pstl-?stl-f ragment was reclon 
in the vector pUR290 (DSMT3417) (Fig. -20) (U.Ruther et al. , infra) 
The resulting recombinant plasmid is coding for a fusion 
protein of an aminoterminal region of the S-galactosidasa , 
followed by the aminoacids 232 to 825 of gp 350 and 
aminoacids coded by the cloning-site of p(JR290 and pBR3 22 
nucleotide residues. The respective aminoacid se- 
quence is given in figure 21 . 

50 ^g DNA of the plasmid pUCLP 1.9 were digested with 
100 units- BamHI and Hindlll and separated on a 1 % acarose 
gel as described above. The resulting 1.9 !<b 3amHI/HindIZI 
fragment that contains only a few more nucleotides than 
the Pstl-?stl-f ragment originally introduced into pUCS 
was separated from the other resulting fragments on a 1 % 
agarose gel (as described above) . Finally it was isolaced 
from the gel as described above and iigated into 3amHI/ 
EJindlll digested DNA of the vector ?UR290 ( Q . Riither, 
B. Miiller-Hill "Easy identif ication of cDNA clones", 
ZMBO Journal 1o, p. 1791 (1983)) according to the methods 
described above. 
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The next step was the transformation of the B-galactosi- 

dase repress or -protein overproducer. strain - ... 

E. coli K12-JM109 with ~~ these recombinant 

DMA molecules. The transformants were plated and analysed 
as described above , except that the aliquots of the DNA 
preparations were digested with BamHI/HindHI and EcoRI. 
The resulting clone, E.coli-K12 JM109 . pORLPi.9 carries 
the plasmid p(JRL?1.9, that is a recombinant of said 
3aztiHl-HindIII 1.9 !<b fragment of the plasmid pOCLP1.9 and 
the vector p(JR290 (see Fig. 20). 

Example 13 

en 3S0 related oolv-nebtides svrithesized bv E. coli Kl 2 

^ . 

JM109 PURLP1.9 

p • r 

In an overnight culture E. coli K12* JM109 pURLP1.9 
was grown at 37 °C in Seal L-rbroth supplemented, with 50 \xq/ 
.tlL Ampicillin*. The culture was then diluted to an optical 
density at 560 nnL(OD 56Q )of 0.4, and A ml of this bacteria 
suspension were incubated at 37°C until an 0D^^ rt of 0.8. 

The expression of- the genetic information carried by" 
plasmid. pURLPI .9 was then induced* as described in 
example 3 and finally the proteins were visualized by 
coomassie-blue staining as described in example 3. 

Xn comparison with the control experiment f several new 
proteins, e nc od ed by the plasmid pCJRLPI . 9 " and ranging in 
size from 116 kD to 200 JcD , were detected" (Fig. 22). The 
different size of the expression products" may be due to 
incomplete mRNA synthesis- oir translation.-' To- prove ' that 
the "new proteins- are' EBV-related- products*, all-' the " 
~ "eiectrophoretically" separated proteins" were" transferred 
to a nitrocellulose filter, i.e. a "Western-blot" was 

prepared according to the method described in example 3. 

The results of this experiment are shown in Fig. 22. 
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Example 1 4 

Purification of the S-cral • : cro 350 fusion protein encoded 
by the pla'smid oCJRLPI.9 

The replacement of the natural carboxyterminal amino 
acid sequence of the 3-galactosidase by a gp 350 related 
amino acid sequence prevents the formation of 3-caiaczosi- 
dase tetramers. Furthermore the newly expressed fusion 
protein is present in a high concentration in the bacterial 
ceil. Therefore the fusion protein precipitates in the 
cytoplasm of the host ceil. 

■r • 

according to the method described in example 4 the clone 
E. coli K12 JM109 pURLP 1 . 9 _. was- used for the production 
of the corresponding fusion protein. 

The results of the several stages of this purification 
procedure are shown in FigT 23. 

Example 1 5 

Expression of selected' antigenic epitopes of gp250/35Q 
as fi-galactosidase fusionproteins 

Fig. 2 4 shows the computer-predicted secondary structure 

of gp350 together with the relative values of hydrophilic 

. (dark.. circles) and hydrophobic (grey circles) areas,. B-tums 

or. loop structures are indicated as line turns of 180° 

- (oc-helices , . fi-sheet and coil structures are barely dis- 

cernable in the scale used) . Based on the assumption 

that. antigenic sites are mainly located in fl-turns in 

an hydrophilic. environment , which may be exposes to the 

surface.- o£. the. protein,- the. regions at about aminoacid 50 

and aminoacid 740 and 800-830, respectively, are expected to represent 
antigenic epitopes. 
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1 Subcloninq and expression of the N-terminus of gp250/350 

The EBV BamHI-L fragment which was cloned in pBR322 (see J. 
gkare et al. , supra) was digested with EcoRI (restriction sites at 
5 positions 650 and 1284 in the sequence given in Fig- 17), 

the resulting 634 bp fragment was elated fran an agarose gel 

after electrophoresis and ligated to EcoRI linearised pOC1 9 

et al. 

(DSM3425) (yanisch-PerronV Gene 33, 103-119 (1985)). Then, 
E-coli K12 JM109 was transformed. with the ligation pro- 
ducts (all steps were carried out as describedin Exanple 2). 
According to example 2 the recombinant plasmids obtained 
<were tested- for the orientation 6f their insert- using suitable re- 
striction enzymes. A reccrabinant plasmid carrying the insert in the 
15 -opposite orientation of the reading frame relative *tb the reading 

frame of the lacZ gene of the pUC19 plasmid was designa- 
ted as pUC19LEP600 and used for further cloning: 



G.1732-! 



20 



25 



35 




pQC19LEP600 was digested with BamHI and PstI (the BamHI 
30 site is derived from pUC19, the PstI site corresponds 
to position 1248. in Fig. 17), the resulting 600bp 
fragment was inserted .into- pUR291 (DSM3418 ) (Riither, supra) , pre- 
viously digested with BamHI and PstI . The resulting • 

*«. „ * .... 

recombinant plasmid pURLEP600 displayed the following 
sequence in its linker region at the C-terminus of 
the 6-galactosidase: 



J 
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PUR291 / PUC19 / gp250/350 
B-gal-TET CGG GGA TCC CCG GTA CCG GAG CTC GAA TTC CCA TTT 

/ ?(JR291 • 

TGC AGC CAA GCT TAT CGA TGA 



The expression of the fusion protein from this recom- 
binant plasmid after IPTG- induction was carried out 
as described in example 3. The result of this experiment 
is shown in Fig. 25. From Fig. 25 (lower part) it can 
be taken that the expression product obtained is 
recognized as a moderately antigenic protein by a pool 
of NPC-sera. 

Subcloning and expression of the C-terminus of gp250/35Q 

The region covering the antigenic epitopes near the 
C-terminus which, according to the computer-directed 
analysis, also is expected to be antigenic, was isolated 

by digesting the plasmid pUCLP1.9 (see example 11) with 
XmnI (restriction site at position 2760 in Fig. 17) and 
Hindlll (restriction site in the region derived from the 
pUC-plasmid) . The purified 386 bp fragment was inserted 
into pUC19 previously digested with Hindi and Hindlll. 
The resulting plasmid which was introduced into E.coli 
K12 JM109 is pCJC1 9LXP390 : - , 
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The insert of pOC19LXP390 was. cut out with BamHI and 
Hindlll and ligated into pUR288 digested. with the same 
enzymes. The resulting recombinant plasmid was intro- 
duced into E.coli K12 JM1 09 . and was designated as 
pURLXP390. The sequence in its .'linker, region is as. 

follows: 



10 



PUR288 / PUC19 / : . 9P350 

G-gal-TGT CG6 G6A TCC TCT AGA GTC AGT TCC CAC— 

pUR288 - 
TTA TCG 



/ pUC8 / 
■GTA CTG CAG CCA AGC 



15 



20 



25 



30 



35 



After IPTG- induction a 6-galacto'sidase fusion protein 
was synthesized by said transformed host. In" a "Western 
blot the expression product shows a high reactivity with 
the NPC sera pool (see Fig. 25,_lower- part). 

Exam ple 1 6 

-~ — — - 

* 

use of the a-aal;:gp250/3S0 fusio n proteins encoded 
bv the newly constructed recombinant plasmids in dia- 
gnostic tests 

jilg et al. (W.Jilg and H. Wolf, "Diagnostic Significance of 
Antibodies to "the : Epstein-Barr Virus-Specific Membrane Antigen 
gp250», The Journal of Infectious Diseases-,-152,-2-22-225 ( 1 985) ) 

have shown the validity of gp250 . * ^ . 

and-g P 350 as antigens for the determination of the a-ina. 
status to EBV and especially for the diagnosis of chrome 
EBV- infect ions: Persons showing a normal immune response 
-after an EBV-inf ection possess antibodies against gp250 
'=-and gp3507=wherea S -patients^uffering- from- chronic 

EBV- infection show an immune. response- only to gp350 which 
still contains the additional, intron sequence (see Fig. 
27) The serological status of these persons can be 
checked in ELISA tests using the three fusion proteins, 

J 
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purified according to the method given in example 4. An 
antibody reaction to all three fusion proteins indicates 
a normal immune status. -I'f there- is nor or weaker reaction with the 
proteins encoded by pURLEPSOO ana ?URLXP390, but reacti- 
vity against pURLP1.9 (which contains the intron se- 
quences, see Fig. 27) a chronic EBV-inf ection is very 
likely. 

IgA antibodies to the membrane protein gp250/350 and 
to subfragments thereof are absent in the normal popula- 
tion, but present in 58 % of Nasopharyngeal Carcinoma 
patients when measured in a relatively insensitive 
immunofluorescence assay. These results are similar to 
the detection rate of IgA antibodies to EBV specific 
early antigens in comparable testsystems. In analogy 
the more sensitive ELISA test brings the detection rate 
close to 100 % with only minimal increase of false posi- 
tive results. Therefore the antigens encoded by the 
newly constructed recombinant plasmids p(JRLEP600, 
p(JRLXP390, and pURLPI . 9, respectively , are valuable 
substances for the initial diagnosis and the control 
of a therapy of Nasopharyngeal Carcinoma. 

Example 1 1 
- 

Expression of the N-terminal gp250/350 fragment in the 
plasmid pUC8 

The recombinant plasmid covering the N-terminal region 
of gp250/350, pUC19LEP600 (see example 15), was digested 
with BamHI and Pstl. The EBV derived fragment was. iso- 
lated and ligated into pUC8 , previously digested with the 
same enzymes. The sequence in. the.. ; linker, region of the 
resulting clone, pUCLEP600, is the following: 



- 69 - 
/ pUC19 . 

/ 

ATT. -ACS AAT TCC CGG GGA TCC CCG 

pUC8 

ITT— — -ACC TGC : AGC CM GCT TAT 



• 0173254 

• * • * i • - 

> / gp350 

GGT ACC: GAG CTC GAA TTC CCA • 



10 After induction with IPTG^the fusion protein encoded 

by p(JCLEP600 is quite stable in the bacterial cells and 
is recognized as an antigen by the NPC sera pooL (see 
Fig. 26). The: bacterial fusion^ part consists of. 14 amino- 
acids at. the N-terminus and 9 at. the C-terminus. The 

15 value of. this protein is its applicability in a vaccine, 
especially when it is fused with the per se instable 
second antigenic region from the C-terminus as it was 
determined with the 8-gal fusion proteins (see example 15) 

20 .. The- inserts of the recombinant expression plasmids and 
cloning plasmids constructed according to examples 11 
and 15 to 17 are summarized in Fig. 27. 



25 



Example 1 8 



Expression of the N-terminal part of gp250/350 as a 
- p1 38: :gp250/350 fusion protein . . . 

: The plasmid. pUC19LEP600 (see example 15) was digested 
30 with' PstI and. the . : resulting, 60 Obp fragment was ligated. 
-M.to. the* PstXj linearised, plasmid. pUCARG601 (see example 7) . 
. The gp350- insert was, checked to be in the same orien- 
- tation -as _the pUCARG6 01. -reading frame and the resulting 

recombinant.. plasmid was. designated as pCJCARG1230 . The 

. sequence in the linker region and at the junction sites 
of the obtained plasmids is the following: 



35 
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PUC8 •". / P UC12 / P 138 / pUC19 

AT6 ACC ATG ATT ACS AAT TCS AGC TCT CT6 ACC ATC CTG CA6 STC GAC 

TCT AGA 

/ 9P350 / pUCARGoOl 

GGA TCC CCG GGT ACC GAG CTC GAA TTC CCA TTT ACC TGC AGC GTC GTC 

GTC GTC GTT GAT AAC GTT 

After induction with IPTG, E.coli K12 JM109 carrying 
pUCARG1230 expresses a stable and antigenic protein 
which consists of antigenic regions from* two different 
15 proteins, namely p138 and gp250/350 i (see Fig.- 26). 



Furthermore it can be used as antigen in ELISA tests and 
20 also for vaccination. 

Example- 1 9 - _ " J_ . . * 

Neutralisation test with sera- derived from rabbits 
25 * immunized with gp250/350 antigens 



Supernatants from B95-8 cells- were- used- to- immortalize 

human ""M 1 ^ 1 cord blood cells (Lymphocyte fraction from 

6 

Ficol/Hypaque gradient). 0,5 x 10 lymphocytes were 
30 seeded per 0,5 ml microtiter plate- well and 5 0v til of a 
cell-free supernatant of B95-8 cells- were**" added- and 
allowed to adsorb for 2 hours at 37 °C. After incubation 
the virus-containing medium was removed,* cells were 
washed "with RPMI1 640 F medium containing 1 0 : -% -"fetal calf 
35 serum and incubated in 200 ul of the same^ medium- at 37 °C 
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in a 5 % CO^ atmosphere. Developing colonies of 
lymphoblastoid cells were evaluated not sooner than 
three weeks after the start of' the experiment and 
counted as positive transformation. 



10 



15 



The neutralizing properties of sera were tested by pre- 
incubating for 1 hour under slight agitation aliquotes 
of the Epstein-Barr Virus containing B95-8 cell super- 
natant with 20 ^1 of test serum including the respective, 
preimmunization serum as control in a replicate test before 
the supernatant was allowed to adsorb to the umbilical 
cord blood lymphocytes. After removing the inoculum from 
the cells after 2 hours the maintenance medium (RPMI1640 
supplemented with 10 % FCS) was supplemented with 5 % 
of the respective sera under test for neutralizing 
activity. The following results were obtained: 



20 



PBS 

(control) 
(no ^cleus) 



Virus 



EBV nega 
.„ tive 
. human 
rum 

25 no colo- colonies 
nies 



Virus 



Virus 



EBV posl 
tive 
hunan 
run 

no 
nies 



rabbit 



colonies 



Virus 



rabbit 
1 



Virus 



rabbit 
income 
.serum 2 



no colo- no colonies 
nies 
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1 Example 20 

Cloning a nd expression of antigenic fragments of the virus- 
capsid protein pi 50 

The coding sequence of the diagnostically relevant protein 
5 p 150 (Virus capsid antigen VCA (see Example 1 , Fig. 28D) 
was examined for antigenic sites and. subcloned for the ex- 
pression as B-galactosidase fusion proteins. The N-terminal 
region which is expected to encode an antigenic site was 
obtained by digestion of the Charon 4A phage EB 69-79 
*° (G.tt. Buell, D. Reisman, C. Kintner, G. Crouse, and B. Sugden, 
"Cloning overlapping DNA fragments from the B95-8 strain of 
Epstein-Barr virus (ATCC CRL. 1612) reveals a site of homology 
to the internal repetition", Journal of Virology 40, 977- 
982 (1981)) with BamHI. and a resulting 1 1 76bp- fragment was 
15 cloned into the BamHI site, of pQC12. From. a. resulting plasmid 
with the insertion, in the proper- orientation a 580bp fragment 
was excised with Xhol/Sall. The Sail site derives from the 
PCJC12 linker, the Xhol site is located 33bp upstream from 
the start of p150. This fragment was inserted into pUC8 di- 
20 gested with Sail (Sail and Xhol share the same sticky end 
sequence) . The resulting clones were screened to have the 
p150 start codon next to the BamHI site. From a. proper clone 
- the-p150 encoding region "was "cut"" out with BamHr- and Hindlll 
and cloned into pCJR290 digested with BamHI and Hindi 1 1 
25 (pUR290CXH580) . The expression of the 3-Gal::p150 fusion 
protein from this clone is shown in Figure 29. Its ability 
to react very well with a MPC serum pool can be taken from 
Figure 30. 

30 Further, p150;:8-gal fusion contructs were obtained according- 
ly. For example the subclones 

PUR290DBX320, p(JR292DBB1 80 , pUR290DTT 700, pCJRDTT740, 
PCJR290DTP680, pCJR288DPP320 which are indicated in Figure 31. 
From the designation of the subclones, the vector used can be 
taken, e.g. for the construction of subclone pUR290DBX320 
the vector pUR2~90 was used. 
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From Figure 30 the restriction enzyme sites used for subclo- 
ning can also be taken. All clones with the exception of 
pURDBB180 were constructed, by subcloning the desired 
fragments into p UC8 or pUC12 (see supra) to obtain 
BamHI and Hindlll sites-, suitable for the cloning to 
pUR vectors (see mpra) . pUR29 2DBB180 was derived by 
insertion of the I80bp Bgl'll-Bgill fragment (see Fig. 31) into 
pUR292 linearized with BamHI. Figures 29 and 30 
show their expression and antigenicity. 
The B-gal::p150 fusion protein encoded by pUR290CXH580 and 
purified according to example 4 reacts in the ELISA, test. 
as an EBV specific antigen indicating its applicability;;'- 
in diagnosis. Stable expression . ; i • 

was also obtained with the N-terminal fragment of p150L by-;' 

inserting the 580bp fragment (used for the construction- 

(DSM 3424) *...-. 
of pUR290CSH530) into. pU£ 18 /Rising the BamHI and Hindlll*. 

site. The resulting clone -p(JC1 8CXH580 expresses a stable 

• »■ -ft . • 

and antigenic protein of.'about 25kD in size.- 
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The. following deposited plasmids, host bacteria and cell 
lines were used for the purpose of the present invention. 
The deposition was affected according to the Budapest 
treaty 





m. o . 




depository 


deDosition number 




B 95.8 




ATCC 


CRL 1612 


E 


.coli K12 


JM83 


ATCC 


35607 


E 


.coli KM 


BMH71-18 


DSM 


341 3 


E 


.coli K12 


JM109 


DSM 


3 423 




pUC8 




DSM 


3420 




P UC9 




DSM 


3421 




pUC12 




DSM 


3422 




PUC19 




DSM 


3425 




PUR288 




DSM 


3415 




p(JR290 




DSM 


3417 




pUR291 




DSM 


3418 




pUCARGS80 




DSM 


3408 




DUC18 




DSM 


3424 



While ve have hereinbefore presented a number of 
embodiments of this invention, it is apparent that our 
cons cruet ions- can be alterd to- provide other embodiments 
which utilize DNA sequences of . the E3V genome coding 
for Z3V-raiatad antigens and for producing recombinant 
DMA molecules. It is obvious to those skilled in the art 
that other DNA sequences may also be used, which are 
related to said DNA sequences and which aav be derived 
from other EBV serotypes. The 13 V is easily obtainable 
from known natural sources, e. g. from the saliva of 
infected oatients. 



It is obvious that for obtaininc biolocicalW 
comparable results other suitable vector/host svstams 

■m 

can be used. The Invention 1st not limited to host/vector 
systems presently available. 
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Claims : " 

» * m * 

1 . A DMA sequence of the EBV-genome, characterized in 
that it corresponds to. at least a part of an EBV-re- 
lated antigenic protein, having an aminoacid sequence 
as shown in Figures 3, 17 , and 28. 

2. A DNA sequence according to claim 1, - characterized 
in that it corresponds to at least a part 

of .protein p150, p143, p138, pUO, p105, 

p90, p80, p54 or gp250/-3-50. 

* • ^* 

3. A DNA sequence according to claims 1 or 2^ 
characterized in that it"-conta*ins additionally the 
respective regulatory ' sequences in the 5 1 and 3' 

flanks. . 

* . 

t 

*^ 

/• ■ * 

* — . 

4 . A DNA sequence hybridizing to a DNA sequence accor- 

■*■ . *.- 

ding to anyone of claims 1 to 3 from whatever source 
obtained including natural, synthetic or semisynthe- 
tic sources, which is related by mutations, including 
nucleotide substitutions^ 

nucleotide insertions and; inversions of nucleotide^ 
stretches to a. DNA sequence according to claims 1 

to 3 and which encodes .at least a part of a protein 

.»■ . - 

according to claim 1. 

5 . A DNA sequence according, to claim 2, characterized 

in that it is inserted"' in" the recombinant plasmid 
pUC6130, pCJC635, p(JCP40<) : , pUCP380, pUCP600, pCJCP210, 
p(JCP750, pCJCP540, pCJCHP-/ pUC924, pMF924, pKK378, 
pUR600, : pUR540, pUCARG6$0 or pUCARGTT40\ • 

1 

° .' A DNA sequence according .to claim 2 , 'characterized 

■ * 

in that it is inserted in the recombinant plasmid 
pUCLP1.9, pCJRLPI.9, pUCI 9LEP600 , pUC19LXP390, 
OURLXP390, pUCARG1230,pUCLEP600, p(JCLXP390 and pURLEP600. 

J 
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7. 



in D l:T nCe . aCCOrding t0 CUim 2 ' ^-terized 
« that lt is inserted in the recombinant plasmid 

PDM90CXH580. PUR290 OBX3 2 0, p UR292DBB18 0 ^ " o^TTOO 
pURDTT740 , P UR 290 DTP680 or P UR 2 88DPP 32 0 . R29 ° DTT7 °°' 

8 . A DMA sequence characterized in that in contains in 
reading frame at least two regions of a DMA sequence 
of anyone of claims 1 to 4 derived from a single 
EBV genome, 

9. A DMA sequence according to claim 8 , characterized 
in that it contains in reading frame- at least two 
regions of a DNA sequence of anyone of claims 1 to 
4 derived from different EBV genomes. 

10 : A DNA . se ? ue . n 5 e ..according to anyone of claims 1 to 9 
characterized in that it contains at its 3 ' end 
three to fifteen arginine codons positioned in the 
correct reading frame followed by at least one stop 

cp.don-r-",. 

• A DNA sequence according to anyone of claims i to 6, 
-:^ ara ? teri2e d"in-that it contains at its 5' end 

an oligonucl.otida encoding an oligopeptide which serves 
in the resulting .. polypeptide as a cleavage site for a 
sequence specific protease or which is cleavable by 
add treatment with an acid such as formic acid. 

_ A recombinant DNA molecule for cloning, characte- 

-_^ ized . in . that it co ntains ^ pNA sequence according 
to anyone of claims 1 to. 1 1 . 

* • WW 

A recombinant" DNA molecule.for expression, 
characterized in that it contains a DNA sequence 
. according- to anyone of the.„claims 1 to n that 
-is operatively linked to an expresssion control 
sequence. 



* 



A recombinant DNA molecule" according to claim 13, 
•characterized in that" the expression control 
sequence is selected from the group of the 
E. coli ^.promoter system/: the E. coli lac-system, 
the E". coli S-lactamase--sy.stem, the-E. : -coli 
trp-system, the E. coli lipoprotein promoter, 
yeasts and other eukaryotic expression control 
sequences . 



5 . Vector carrying a part of- the p138 encoding 

DNA^sequence the encoded- protein of which stabi- 
lizes in a fusion protein, a protein encoded by 
a DNA sequence ligated'to.'.its 3' -end and carrying 
a. DNA sequence encoding three to fifteen arginine 
residues followed by at. 1'east one stop codon 
which after insertion of £he second DNA sequence is 
positioned at the 3 1 -end. df' this second" sequence 

in the correct reading frame.,. ■ - ' 

— •.* * - ** 

. . * . ■ 

6. - Vector according, to claim.- 1 5 which, is pCJCARG601. 

7. A host, characterize^ in^that it is__trans formed 
by at least one recombinant DNA molecule according 
to anyone of claims 1 2 to "I 4. • 



A host according to claim 17- selected from the 
group consisting of strains of E. coli, othejr 
bacteria, yeasts ,- other fungi, animal. 'and human 
cells. 



— — — — — - - - — ii a / - *- 



• ^protein" having *EBV-related" antigenic determinants 
suitable ' for" diaghosi&' knd"" therapy of "EBV-related 
diseases, characteri*zed~ih that it- is 'encoded by 
a DNA sequence according:* to anybne~of*~ claims 1 
to 11 . 
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10 . A polyantigen having at least two EBV-related 
antigenic determinants suitable for diagnosis 
.and therapy of EBV-related diseases, characte- 
rized in that it is encoded by a DMA sequence 
according to anyone of claims 8 and 9 . 

A fusion protein, characterised in that it con- 
tains a protein according to claims 19 or 20 . 

2. ... A diagnostic composition for the detection of 
anti-EBV-antibodies, containing at least one 
protein according to anyone of claims 19 to 21 
in an amount sufficient to bind said anti-EBV- 
antibodies in a sample. 

23- A diagnostic composition for the detection of 
EBV-related diseases, containing at least one 
DNA sequence according to anyone of claims 1 to 
- 11 in an amount sufficient for hybridization 
to an EBV-related DNA sequence in a sample. 

4. A pharmaceutical composition containing at least 
one protein according to anyone of claims 19 
to 21 in an amount sufficient for stimulating 
in humans the production of antibodies to EBV 
and a pharmaceutically acceptable carrier or 
diluent . 

. A method of preventing EBV infection or therapy 
of EBV-related diseases comprising administering 
to a human being the pharmaceutical composition 
according to claim 24 in an amount sufficient 
to induce or to modulate an immunoresponse. 
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mononucleosis and NPC. 
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Figure 2 

Mapping of mRNA's relative to the EBV B95-8 genome 
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121 



Ficpire 3 *" 1 

GITTGCGAGGCTGGGCGGCAIGCCAAGAICGCTGAGACGT^ ^ 
+ f * " 

CCtGGCCAGCCTGACTGACTTCCTGAAATIOT ^ 

TGAIGAGTAAAG XGTAACATI ~ AATGTGGG^ 130 



13 



o 1 




d 1 



301 



361 



431 



541 




60 1 



721 



731 



240 



CflGCTACAlCTACTTTIACCCCCTOa^ ^ 



360 



420 



430 



40 



00 



'3 0 



0 



ACTCMCAacaaAAACCTQGsaGCCAttccAtcsTtAAAACAQaAMTAacM ? _ o 




730 



840 



r J CTGGC5CAGGCr J CTGCGCr,TCCr 1 *ca<3G^ 

aAacwMaaACCAascc^^ 
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nS*r LeuGlnAspGlnA laLys'Ja LAlaLy-sVa lAlaPpaLeuLysGiuPheP r oA laS* 

AACCATCAGTCACCCrjGACTCoGG^GCCTt^ATGMlTGtGGACA'oCGCGGCAIGCGAGCT 

961 + + + + 

r thr UeSerHi3?roA3p5erGlyAl.jLe'jirtet IleValAspSepAlaAiaCysGluLe 

GGC3GTf3AGCTACGCACCCGCCA XGCIGGAijGCCTCGC ACGAGACCCCGGCCAGCCTCAA 

1021 * * * " r " 

uAlaVa ISerXyp AiaPpaAlaMetLeuGluAlaSarHisGluXhp ppoAiaSepLauAs 

CXACGACXCGXGGCCCCXGXX TGCCGACXGXGAGGGXCCAGAGGCCCGXGXGGCXGCG 7X 

1031 + + + + 

nXvpAspSepXpoPpoLeuPheAlaAsoCvsGlijGiyProGlijAlaAr^ValAlaAlaLe 

ACACCGAXAtnAXGCC AGCCTGGCCCCCCACGTGTCCACGCAGAICTXTGCCACCAATTC 

1141 + + + * + + 

ijHisAP3ly pAsnAlaSerLeuAlaPpoHisValSerXhrGlnllePheAlalhpAsnSe 

CGXCCXCXACGXCXCGGGGGXCXCG AAGXCAACCGGXCAGGGCAAGGAGAGXCXCXXXAA 

1201 * + + * + * 

pValLeuXvpVaiSepGlyVa ISer LysSepXhrGlyGlnGly LysGluSer LeuPheAs 

PstI 

CAGXXXCXACAXGACCCACGGCCXGGGGACCCXGCAGGAGGGGACCXGGGACCCCXGCCG 

1261 + + + + ^ * 

nS*p?heXypr1etXhpHisGlyLauGly XhpLeuGlnGluGlyXhpXrpAs pProCysAr 

a 

CCGACCCXGCXXCXCGGGCXGGGGXGGGCCAGACGXGACCGGAACCAACGGXCCGGGAAA 

1321 : + r~* + + 

3AP3Ppoeys?heSerGlyXppGlyGlyPraAspVaiXhrSiyXhr AsnGly ProGIyAs 

CXACGCXGXGGAGCACCXGGTCXAXGCGGCCXCCXXCXCGCCCAACCXXCTXGCCCGCXA 

1331 + + + + ' — + ' r 

nXypAlaUalGluHisLeuValXypAlaAlaSep PheSer ProAsnLeuLeu AlaApqXy 

P*tl S*tl 
._ iGCCXACXACCXGCAGXXXXGCCAGGGACAGAAGAGCXCXCXGACCCCuGTGCCGGAGAC 

1441 + + -+ + + ^ 

pAlaXyrXypLeuGlnPheCysGInGlyGlnLysSerSerLeuIhrProValProGluXh 

GGGCAGCXACGXGGCGGGGGCGGCCGCCAGXCCCAXGXGCXCGCXCXGCGAGSGCCGGGC — 

1301 + + * * * * 

pGlySer XypValAlaGlyAlaAlaAlaSepPr oHetCysSer LeuCysGiuGIyApaA 1 

a 

CCCGGCCGXGXGCCXGAACACGCXCXXCXXXAGGCXGAGGGACCGCXXCCCCCCCGXCAX 

L561 + + * + * * 

aProA1.3MalCysLeuA«f.XhrLe«jPhePheAr3LeuAP3AspAr3?heProProWaiMe 

GXCCACGCAGCGGAGGGACCCCXAXGXGAXCXCGGGGGCCXCGGGCXCCXACAACGAGAC 

1421 + * * " *" * - 

tSer xnpGlnAP3Ap3AspProXypyal IleSerGlyAlaSerGiySepXy p Asr.GLuXh 

GGACXXXXXGGGCAACXXXCXCAACXXCAXCGAXAAGGAGGACGACGGGCAGCGGCCGGA 

1631 + + + + * 7 * 

c A s p P h e L e»jG 1 y A s n P he L e«j A s n P h e 1 1 a As p L y s G 1 y A a p A s p G 1 y G 1 n A p 3 f r o A s 

CGACGAGCCCCGCXACACCXACXGGCAGCXGAACCAGAACCXGCXGGAGCGGCXGXCXCG- 

1741 * ; + m + + '"' 

pAspGluProAP3XypXhpXyp Ip pGlnLe»jAsnGlriAsnteuLe*jGl<jAP3LeuS*ipAP ^ 

GCXGGGC AXAGACGCXGAAGGAAAGCTAGAGAAGGAGCCCCAXGGCCCGCGXGACXXXGX 

1301 ♦ + * * 

3La«jGly HeAspAlaGlijGlyLysLe«jGi'jLysi31iiProHisillyProAr3^spPheVa 



1020 



1030 



1200 



1260 
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1440 



1500 



1560 



1-520 



1680 



1740 



1300. 
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1361 



1930 



t o 



2040 



1"? 31 



2100 



2041 



2101 



CAAGAXGXXCAA3aACGXGGAXGCGGC3G73GACGC^ ,,20 

CAXGGCCAAGAACAACATCACCTACAAG3ACCTGG^ 
^H^A^U^T^^ 

gxacxc3X3Caaccccxxxgcgcagc^ 

CXCACXGCX3ACCAXCcl^3GACAXCXCCCX^ 

C^ ACCCCGGGCXXG3CC AGAGCCCCCCAGAGXGaCX AAAQG^ ^ 
^^oG^U^^^ 

C ACCAA«ttAaa*accTGaccATca*CA^ „ 30 

2161 i™;;;;;;;;;^;:^ 

ggx3Cax8G3gagcccaccxgc3accx3Cca8accx3Gac3CG^ 223o 
«Acaacc3aca6CMCCT8T8cacAtaTCCAAG8TGCtG«6CMTM M40 

C AAGAXCAA3AACAGG3XGGXCXXCACGG3GGA6AAXGCCGCC^ 

2341 i^rim^Ar^ 

caagxccacxaccaggagggagaacxacaxc axcaacgggcccxacaxga^ 24S0 

2401 i:;:^;^^^*!^ 

CACCXACCACAAGACCCXAXTCCC3GACACXAAGCICXCAA3CCTGX^ ^ 

CTT-ICCAG8C86CaCtCaGTCCCtGTCCCCA8C88QGCCA6CeCQGA^ „ 3<J 
CCt8GCCCTCtTT8taaAC8G866CSCCCG0QCCCAC8AAGASAGCAACatC« ^ 

;i;:au:;^u^ 



252L 



2531 



2641 



2701 



GGXGCCX3GCAACCX3GXCACTIACGCCAA3CAGA3G^^ 
rJIIJroaWAwZ^lKhri^AUL^"^ 

axacaaccAGACCCAaxxcxACAXCAGCCtaAXTCAGGGACXGGiacc^ 27M 
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' J 'J T '3 'J C *j i— O 'J ! J i 'i ri o ;nLuLUwfi^'j tnL i ' j 'i u > rf n L u U 1 J 'J '3 l 'J '2 iO'JM 1 ] o^ti']^ ; a ♦ J 

2761 + * - 

rVaLProAi aAr^A-spIyr ? r a K i s V a 1 L * *j G 1 y X h r A r 3 A i a V a 1 G 1 ■ j S e r A 1 3 A 1 a A I 

CTACoCGGAGGCCACCTCCtCCCTtACtGCaACCACijoXGGTCT^CijCGiiCCACAGACiG 

2321 + + + : * 

3 X y p A i aG 1 u A 1 a I h r 5 -3 r 3 * r X hp A 1 3 Xhr Th p « a i V -3 1 Cy s A 1 a A i a I h r A 3 pC y 

TCXXAGCCAGGXCXGCAAGGCCCGXCCGGXXGXCACGCTGCCAGIGACCAiCAACAAGXA 

2381 + + * + 7* * 

3 L.3»j£erG inVa ICys Lys AlaAroPro^ a 1 Va IXIip Leu? roValThr II sA-snLysXy 

CACGGGGGXCAACGGCAACAACCAGAXAX2CCAGGCCGGGAACCTGGGAXACTTXAXGGG 

Q ^ 1 ; ■ • — — — -r — — 

r XhpGlvVal A* nGIvA-snA-snGln IlePheGlnAl aGlyAsnLeuGly XyrPheHetGl 

CCGGGGCGTGGACAGGAACCXGCXGCAGGCCCCC3GGGCXGGGCXGCGCAAGCAGGCCGG 

3001 p * * - 

v ApqG iyVa 1 A-soApoAsnLduLeuG InAlaPpoGlvA laGlvLeuArgLysGlnAiaGl 

GGGCXCTXCCAXGCGGAAG AAGXXXGXCXXXGCCACCCCCACCCXAGGGXTGACCGXGAA 

3061 + > + + 

■/GlySepSeprtetApgLysLysPheValPheAlaXhr ProXhrLauGlyLeuIhryalLy 

GCGCC3GACCC A AGCCGCGACC AC A XAXG AG A XXG AG A AC A XCAGGGCXGGCCXGGAGGC 

2121 ♦ + * * 

sAr^ArglhrGlnAlaAlaXhrXfcrXyrGiu IleGluAsn IleApgA laG lyLeuGluAl 

C AXXAX AXCACAAAAACAGGAGGAAGACXGXGXGXXXGAXGXGGTGXGC AACCTXGXGGA 

3131 f + * 

a II* tleSerGlnLy^GlriGltjGl'jA^pCy-s'JalPheAspyalValCysA^nLa'jValAs 

XGCCAXGGGCGAGGCAXGCGCCXCGCXGACXAGGGACGACGCGGAGTACXXAXTGGGCCG 

3241 * + + + 

""" pAl.BrtetGlyGluAlaCysAlaSepLeuXhpAp^AspAspAlaGiuIypLeuLeuGlyAp 

CXXCXCCGXCCXGGCGGACAGCGXCCXAGAAACCCXGGCGACCAXXGCCXCCAGCGGGAI 

3301 + + + + + + 

3PheSerVa !L**jAlaAs pSer ValLeuGluX^p LauAiaXhr IlaAIaS^rSepGly II 

AGAGXGGACGGCGGAGGCCGCXCGGGACXXXCIGGAGGGAGXGXGGGGTGGGCCCGGGGC 

3361- * * * 

sGluXr pXhrAlaGLuAiaAlaAr3Asp?heLeuGlu»3IyyaiXi:pGiyGIy?PoGIy.Ai 

AGCCCAGGACAACXXXAXCAGCGXGGCCGAGCCGGXCAGCACCGCGXCGCAGGCCXCGGC 

3421 * + + + 

aAlaGlnAspAsnPhe IleS-»r^al A 1 aGluPpoVaiSer XhpA laSepGlnAlaSepAi 

CGGGCXGCXGCXGGGXGGAGGAGGGCAGGGCXCCGGGGGCAGACGCAAGCGCCGXCXGGC 

3481 > + * * * 

aGlyLeuLeuLeuGlyG LyGlyGlyGlnG LySepGlyG IyApgAr3LysAroAP3us<jAl 

Xho I 

CACCGXXCXCCCCGGACXCGAGGXCTAGAGACCCCTGGGGCGGCGAXGXCGGGGCXGCXG 

3541 + + * * * * 

aXhpYalLeuProGly-e'jGluValEnd 

GCGGCGGCGXACAGCCAGGXGXACGCCCXGGCGGXXGAGCXGAGCGXGXGCACCCGGCXG 
3601 + + + + + * 

GACCCCCGGAGXCXGGACGXGGCXGCGGXGGXGCGCAACGCCGGCCXGCXGGCCGAGCXG 
3 661 -* r- * * + * * 
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la^GGCCrtTCCTCCTICCCCaiTIGAGACoGCAGAAtGHCCaTGCATGCAGCGCCCIGICC 



CTGG AGCIGGiGCACCTGCTAGhG AACTCG AGAGAGGCCTCTGCCGCGC ib'- I'-'jui-w^ i 
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Figure 4: 

Restriction map of the plasmids pUC635 and p(JC6130. 
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Figure 5: 

Expression of the.p138 fusion protein encoded by pUC635, 
pUC924, pMF924, and pKK378 
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Figure 6: 

Restriction map of the plasmid pUC924 
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Figure 7: 

Restriction map of the plasmid pMF924. 
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Figure 8: 

Restriction map of the plasraid pKK378 
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Figure 9:_ 

Secondary structures of pi 38. 
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Figure 1$: 

Expression products of bacteria transformed with 
the pUR-carrying PstI fragments of p138. 
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Figure 1 1: 

Expression products of bacteria transformed with the 
P (JC subclones carrying PstI- fragments of p138. 
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Figure 12: 

Construction scheme for pUCARG1140 encoding both 

antigenic sites found by expression as B-gal fusion proteins 
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* oltgo-arg Maker 




5* 3 
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Figure 13; 

IPTG-induced expression of the plasmids pUC600, pUC601 , 
PUCARG601 and pUCARGTT40 with pUC8 as a control 
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Figure 14; 

Distribution and reactivity of the IgG and IgA 
antibodies of individual NPC-sera against the two 
epitopes detected in p138 
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Figure 15; 

ELISA test using the protein encoded by pUCARG1140 as antigen 
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Figure 16: 

Purification of proteins carrying oligo-arginine 
peptides at their carboxy- terminus . 
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Figure 17 

GCATCCGAAAAACTGGTCTATGGCTCGTGTGTCGATGCGCTGAAACCAACCGCAACAAAT 
I » * » + 60 

TACTTACCTTGTTGTTGTGTGATGGGTAAAAACACACATCACACACTTAGGCCATAGGGA 
61 - - — "* - " 120 

TGCTCACCGTAGCCGCGGCTCCAATCGCTTGAAGAAGTGTTCTTAGATCTAGTGGAAACC 
121 -* * * * * "* l 90 

TGCGGAGAATGGCTTCTCGCCCAGGGAGATCCGGCTGGGGTGGGAGCATGGGTCGTGCTG 
iax ♦ * — * ♦ - — 240 

GAGCTGACCCACCGGCATCATGATCGACCCGCTTTCTCTTCGTACCCTTCTGGGCCGGCT 
24i » * * * * 300 

CCAGGTGGGCATCTTCTGCTTCCTTTTCTGAGCTGCTATCTGATAACTCTATGAGGACAT 
301 * * 360 

TTTCCCAATCTCCCGCCGATACCTGTTCCTGCACAACCGAGGTAGATGGGACTTCTTCTT 
361 * - ~ * 4 20 

CCATGTTGTCATCCAGGGCCGGGGGACCCGGCCTGTCCTTGTCCATTTTGTCTGCAACAA 
42 L — * ^ ▼ f 480 



AAGTGTGACTCACCAACACCGCACCCCCCTTGTACCTATTAAAGAGGATGCTGCCTAGAA 
4fll ♦ * - - S40 

ATCGGTGCCGAGACAATGGAGGCAGCCTTGCTTGTGTGTCAG7ACACCATCCAGAGCCTG 

5 41 _--___----r * ~ -* * - — -* 600 

MetGl uA 1 aA 1 aLeuLauVa I CysGt nTyrThr I leGlnS«rt_eu 

Ecofll 

ATCCATCTCACGGGTGAAGATCCTGGTTTTTTCAATGTTGAGATTCCGGAATTCCCATTT 

SOi . " — — — 660 

I 1 eHlsLeuThrGl yGluAspProGl yPhePneAsnVa I Glul 1 eProGl uPheProPtie 

TACCCCACATGCAATGTTTGCACGGCAGATGTCAATGTAACTATCAATTTCGATGTCGGG 

661 * - * * * 720 

TyrProThrCysAarWa 1 CysThrA > aAspva I AsnVa I Thrl 1 eAsnPheAspVa IG1 y 

GGCAAAAAGCATCAACTTGATCTTGACTTTGGCCAGCTGACACCCCATACGAAGGCTGTC 

721 * " * 7 80 

G 1 yCysCysrtl sGl nLauAspi-euAsoPhaG \ yG I nt-auThrProHl aThrUyaA 1 aVa I 

TACCAACCTCGAGGTGCATTTGGTGGCTCAGAAAATGCCACCAATCTCTTTCTACTGGAG " 

7Qi + - — * * * 340 

TyrGlnProArgGtyA laPheGI yGl ySarGI uAsnA I aThrAsnLauPhaLauLeuGl u 

Hlnalll 

CTCCTTGGTGCAGGAGAATTGGCTCTAACTATGCGGTCTAAGAAGCTTCCAATTAACGTC 

84l - - ---- — * 900 

LauCauGl yA 1 aGt yG1 uLeuA l aLeuThrM* tArgSartysuysLeuProI 1 eAanVa \ 

ACCACCGGAGAGGAGCAACAAGTAAGCCTGGAATCTGTAGATGTCTACTTTCAAGATGTG 

901 — ♦ - - — ' 960 

TnrThrG 1 yG 1 uG 1 uG 1 nG 1 nva 1 Sart-auG 1 uSarVa 1 Aspva I TyrPheG 1 nAsoVa I 

TTTGGAACCATGTGGTGCCACCATGCAGAAATGCAAAACCCCGTGTACCTGATACCAGAA 

961 * ~* 1020 

PnaG I y ThrMe t TroCy sH 1 sri 1 sA I aG 1 uM« tG 1 nAsnProVa I Ty rteul I eProGiu 

ACAGTGCCATACATAAAGTGGGATAACTGTAATTCTACCAArATAACGGCAGTAGTGAGG 
1021 - — * 1080 
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Thrval ProTyrI I eLysTrpAspAanCysAsnSerThrAanl 1 eTlirA 1 aVa I Va I Arg 

GCACAGGGGCTGGATGTCACGCTACCCTTAAGTTTGCCAACGTCAGCTCAAGACTCGAAT 

1081 ♦ ♦ * + * ♦ H40 

A I aG I nG 1 yLauAspVa 1 ThrLeuPraLauSarLauProThrSerA 1 aGl nAspSarAsn 

TTCAGCGTAAAAACACAAATGCTCGGTAATGAGATAGATATTGAGTGTATTATGGAGGAT 
1141 * * * ♦ ♦ + 12 qq 

PhaSarVa 1 LyaThrG! uMatLeuG 1 yAanG 1 ul 1 aAspI 1 eGluCys! 1 «M«tG1 uAsp 

Pst I 

GGCGAAATTTCACAAGTTCTGCCCGGAGACAACAAATTTAACATCACCTGCAGTGGATAC 

1201 ♦ + 1260 

GlyGluI 1 eSarG I nVa 1 LouProG I yAapAancy sPheAsnl I e ThrCy aSerG 1 y Ty r 

GAGAGCCATGTTCCCAGCGGCGGAATTCTCACATCAACGAGTCCCGTGGCCACCCCAATA 

1261 * * <* * * * 1320 

G1 uSerHl aVa I ProSarGl yGl y I 1 eLeuThrSarThrSarProVa 1 A 1 aThrPro! t e 

CCTGGTACAGGGTATGCATACAGCCTGCGTCTGACACCACGTCCAGTGTCACGATTTCTT 

1321 * ♦ * * * 1380 

ProG I y ThrG 1 yTyrA 1 aTyrSarLauArgLeuThrProArgProVa 1 SarArgPnaUeu 

GGCAATAACAGTATCCTGTACGTGTTTTACTCTGGGAATGGACCGAAGGCGAGCGGGGGA 

1381 » ♦ * • * * 1440 

G1 yAanAanSarl 1 aLauTyrVa I PheTyrSarG t yAanG I yProLyaA 1 aSerG 1 yGl y 

GA TTA CTGCA TTCAGTCCAACATTGTGTTCTCTGA TGAGA TTCCAGCTTCACAGGACATG 

1441 * * ♦ + > 1500 

AspTyrCyal laGlnSarAanl 1 aVa I PhaSerAapG 1 ul 1 aProA 1 aSarGl nAapMa t 

CCGACAAACACCACAGACATCACATATGTGGGTGACAATGCTACCTATTCAGTGCCAATG 

1501 * * ♦ * * » 1560 

ppoThrAsnThrThrAaoI I aThrTyrVa I G 1 yAspAsnA 1 aThrTyrSarVat ProMat 

GTCACTTCTGAGGACGCAAACTCGCCAAATGTTACAGTGACTGCCTTTTGGGCCTGGCCA 

1561 * * * - * 1620 

va 1 TnrSerGl uAspA I aAsnSarProAanVa I Thrva IThrAl aPhaTrpA 1 aTrpPro - 

AACAACACTGAAACTGACTTTAAGTGCAAATGGACTCTCACCTCCGGGACACCTTCGGGT 

1621 ♦""J'."" * * * * 1680 

AsnAanThrGl uThrAapPheLyaCysCyaTrpThruauThrSarG I yThrProSerGl y 

TGTGAAAATATTTCTGGTGCATTTGCGAGCAATCGGACATTTGACATTACTGTCTCGGGT 

1681 * * ♦ * «■ * 1740 

CyaGluAanI 1 aSarGl yAlaPhaAl aS«rAsnArgThrPh«AspI TaThrVal SarGly 

CTTGGCACGGCCCCCAAGACACTCATTATCACACGAACGGCTACCAATGCCACCACAACA 
1741 : » * * ♦ * -r 1800 

LeuG I yThrA 1 aProLysThrLaul 1 al 1 aTMrArgThrA taThrAsnA 1 aThrThrThr 

ACCCACAAGGTTATATTCTCCAAGGCACCCGAGAGCACCACCACCTCCCCTACCTTGAAT 

1801 ♦ * * * * * i860 

ThrMl sLyaVal 1 I aPriaSarLysA I aProG 1 uSarThrThrThpSarPpoThruauAsn 

ACAACTGGATTTGCTGATCCCAATACAACGACAGGTCTACCCAGCTCTACTCACGTGCCT 

1881 * * * ♦ ♦ 1920 

ThrThrGl yPMaA 1 aAapProAanTMrThrthrG 1 yLauProSarSarThrMl sVa I Pro 

ACCAACCTCACCGCACCTGCAAGCACAGGCCCCACTGTATCCACCGCGGATGTCACCAGC 

1921 * * * * * 1980 

ThrAanLauThrA 1 aProA I aSarThrG 1 yProT^rVa I SarThrA 1 aAspVa I ThrSar 

CCAACACCAGCCGGCACAACGTCAGGCGCATCACCGGTGACACCAAGTCCATCTCCATGG 
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1981 * — — * 2040 

p roThrProA ] aG i y ThrThrSerG 1 yA 1 aSerProVe 1 ThrProSerProSerProTrp 

)X 

GACAACGGCACAGAAAGTAAGGCCCCCGACATGACCAGCTCCACCTCACCAGTGACTACC 

, 0A1 ~ * " — * 2100 

AapAsnG I y ThrG 1 uSart_y sA 1 aProAspMatThr SarSarThrSar Pro va I ThrThr 

CCAACCCCAAATGCCACCAGCCCCACCCCAGCAGTGACTACCCCAACCCCAAATGCCACC 
2^q^ - — ——————— — — — — —— — ——■* — — — — — — — — — .» — — — — —— — — — ■*■— — — — — — — — — ^ — — — — — — — — — 2L60 

ProThrProAsnA \ aThrSerProTnrProA I aVa 1 ThrThrProThrProAsnA I aThr 

AGCCCCACCCCAGCAGTGACTACCCCAACCCCAAATGCCACCAGCCCCACCTTGGGAAAA 

2^g^ — — -*> — — — — — -r — — — — — — f — **^U 

SerProThrProA 1 aVa 1 ThrThrProThrProAsnA 1 aThrSarProThrt_euGl yt-ys 

ACAAGTCCTACCTCAGCAGTGACTACCCCAACCCCAAATGCCACCAGCCCCACCTTGGGA 
2221 — — — — — — — — — ■* — — — — — — — — —-* — — — — — — — — — ■* — — — — — —————— — — — — — — — — ♦ — — — — — — — — —.^ 2280 

Tnr SarProThr SerA I aVa 1 ThrThrProThrProAsnA I aThrSarProThruauG I y 

AAAACAAGCCCCACCTCAGCAGTGACTACCCCAACCCCAAATGCCACCAGCCCCACCTTG 

22Q1 * * *- — *- - 2340 

LyaThrSerProThrSerAlaValThrThpProThrProAanA 1 aThrStrProThrLau 

GGAAAAACAAGCCCCACCTCAGCAGTGACTACCCCAACCCCAAATGCCACCGGCCCTACT 

? 341 * * * * * 2400 

C I yLy sTnrSarPPoThrSerA 1 aVa l ThrThrProThrProAsnA 1 aThrG 1 y ProThf 

GTGGGAGAAACAAGTCCACAGGCAAATGCCACCAACCACAC^ ^ q 

240 1 va 1 Gl yG 1 uThrSarProG 1 nA I aAsnA I aThrAsnHI sThri-euG I yGl y ThrSarPro 

ACCCCAGTAGTTACCAGCCAACCAAAAAATGCAACCAGTGCTGTTACCACAGGCCAACAT 
24 g _ ^ _ _ _ _ — — __ — — _ — — — — — — — — .^ — — — — — — — — — ■»■ 252u 

ThrProVa 1 va I ThrSarG » nProLysAsnA I aThrSarA 1 aVa I ThrThrG 1 yG 1 nHl a 

AACATAACTTCAAGTTCAACCTCTTCCATGTCACTGAGACCCAGTTCAAACCCAGAGACA 
n S2 l - - * * 2580 

* AsnX leThrSarSarSacThrSarSarMatSarLeuArgProSarSarAsnProGiuThr 

™- - CTCAGCCCCTCCACCAGTGACAATTCAACGTCACATATGCCTTTACTAACCTCCGCTCAC ^ 

253 j. — .— — — — — — ♦ ———> — — ————♦••• — -* ~ — — ♦ — ~ — — — — — — — — — — — — — 

LeuSarProSarThrSarAspAsnSarThrSarHtaMatProLeuLauThrSarAlaHls 

CCAACAGGTGGTGAAAATATAACACAGGTGACACCAGCCTCTATCAGCACACATCATGTG 
2 6 a i —_ — — ——— ——■» — ————— - ""■~'*'*""*' — ~ - ~'™~ — — ^ — — — — — — — — — — 

ProThrGl yG 1 yG 1 uAanl 1 aThrG I nva 1 ThrProA I aSar 1 1 aSarThrtH sm 1 sva I 

TCCACCAGTTCGCCAGCACCCCGCCCAGGCACCA^ 27QQ 
2701 SarThrSarSarProAlaProArgProGlyThrThrSarGlnAlaSarGlyProGlyAan 

AGTTCCACATCCACAAAACCGGGGGAGGTTAATGTCACCAAAGGCACGCCCCCCCAAAAT 

_~~_ — — — — — — — — — — — — — — — — — — -r — — — — — — — — — -^ — — — — — — — 28ZU 

27Q1 « — — — — — — —— — — — — —— — — — T 

SarSarThrSar ThrLy sProG 1 yG 1 uVa » Aanva 1 ThrLysG 1 y ThrProProG 1 nAsn 
GCAACGTCGCCCCAGGCCCCCAGTGGCCAAAAGACGGCGGTTCCCACGGTCACCTCAACA 

282 1 ATaThrSarProGl nA t aProSarG I yGl nuyaThrA 1 aVa 1 ProThrVa I ThrSar Thr 

GGTGGAAAGCCCAATTCTACCACCGGTGGAAAGCACACCACAGGACATGGAGCCC^ 

2 q q i ——— — — — — — — — — — — — — — — — — — — — — •» — — — — — — — — — » — — ~ 

G I yG 1 yUyaA 1 aAsnSarThrTnrG 1 yG \ yLysMl a ThrThrG 1 yMl sG 1 yA 1 aArgThr 
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3001 



3061 



3121 



3181 



3241 



3301 



- 4 

AGTACAGAGCCCACCACAGATTACGGCGGTGATTCAACTACGCCAAGACCGAGATACAAT 

SarThrGluProThrThrAspTyrGlyGlyAspSarThrThrProArgProArgTyrAsn 

GCGACCACCTATCTACCTCCCAGCACTTCTAGCAAACTGCGGCCCCGCTGGACTTTTACG 

— ~ ""^ ~ — — ™'~' — — ~™"*'~*"~ — — — — — — — — ■* — — — — — ———_ ♦ 

A I aThrThrTyrLauProProSarThrSarSarLyaLauArgProArgTrpThrPriaThr 

AGCCCACCGGTTACCACAGCCCAAGCCACCGTGCCAGTCCCGCCAACGTCCCAGCCCAGA 

SarProProVa I ThrThrA 1 aG 1 nA laThrVa I ProVa I ProProTHrSmrO 1 nProArg 

Pari *•*•*••••*•••••*•.**« 

TTCTCAAACCTCTCCATGCTAGTACTGCAGTGGGCCTCTCTGGCTGTGCTGACCCTTCTG 
• • tm JCt * nU " uS#rM * tL * uV » 1 u « uG 1 "TrpA I aSaruauA 1 aVa 1 LauThruauLaC 
CTGCTGCTGGTCATGGCGGACTGCGCCTTTAGGCG T AACTTGTCTACATCCCATACCTAC 
LauLauLeuVa t Mat A I aAapCyaA I aPh«ArgArgAsnLauS«rThrS«rh1 sThrTyr 

ACCACCCCACCATATGATGACGCCGAGACCTATGTATAAAGTCAATAAAAATTTATTAAT 
ThrThrProProTy rAapAapA 1 aG I uTnrTyrVa I Ena 

CAGAAATTTGCACTTTCTTTGCTTCACGTCCCCGGGAGCGGGAGCGGGCACGTCGGGTGG 



3000 
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3120 



3180 



3240 



3300 



3360 



CGTTGGGGTCGTTTGATTCTCGTGGTCGTGTTCCCTCACC 
3361 + * «. 3400 
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Figure 20 
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Figure 21 -1- 

ACCATGATTACGGATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGC 
_________._.__..__. — ^ — —j — — — — —— — ^ — — — — — — — -• — -»-- — -- — — -•• — — — "♦ — — -* — — — — - 

M«tl t aThrAspSerLeuAl aVa I Va I LauGl nArgArgAspTrpG 1 uAsnProGI y 

GTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAA 

____,_,— _»,—.,_>,-,.— — —, _••* — —• — —»-»•—<■—♦—.— — — — — — — — ^ — ^» — — 

Va I ThrG 1 nLeoAsnArgLeuA 1 aA 1 ai-H sProProPnaA 1 aSarTrpArgAanSerG 1 u 

GAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGCGCTTT 

___«» — — — — ♦ — — — — — — — — -•-* — — — — — — — — '—■*-• — — — — — — — — ^* 

Gl uA 1 aArgThrAspArgProSarGlnGlnLeuArgSarLauAanGl yGI uTrpArgPne 

GCCTGGTTTCCGGCACCAGAAGCGGTGCCGGAAAGCTGGCTGGAGTGCGATCTTCCTGAG 
________, — _ — — — — — .__. — .^ — — .. — — — — — — -*— — — — — — — — — -» — — — — — -» — — — — — — — — — — — 

A 1 aTrpPneProA I aProG 1 uA 1 aVa I ProG I uSarTrpueuGI uCysAapcauProG I u 

GCCGATACTGTCGTCGTCCCCTCAAACTGGCAGATGCACGGTTACGATGCGCCCATCTAC 
______________ _ _ ,^ — — — — — _ — — — + — — — — — — — — — •» — — — -"•-• — — — — ♦ — — — — — — — — — 

A 1 aAspfhrVa IValVal ProSer AanTrpGl nMa tMl aGl y TyrAapA 1 aPro 1 1 eTy r 

ACCAACGTAACCTATCCCATTACGGTCAATCCGCCGTTTGTTCCCACGGAGAATCCGACG 
________ — _ — — _ — — — — — — — — — — — — — ■» — — — — — — — — —■* — — — — — — — — — ♦ — -*-* — — ■~~ ^ 

ThrAanVa 1 TnrTyrProI 1 eThrVa I AsnPPoProPneVa I ProThrGI uAanProThr 

GGTTGTTACTCGCTCACATTTAATGTTGATGAAAGCTGGCTACAGGAAGGCCAGACGCGA 

______,_ — __.____ — — — — .^ — — — — — — — — — •♦ — — — — — — — ->-—•+ — — ——— — — — — ♦ 

G I yCyaTyrSarLeuThrPheAsnVa 1 AspGluSarTrpueuG InGl uGl yGl nThrArg 

ATTATTTTTGATGGCGTTAACTCGGCGTTTCATCTGTGGTGCAACGGGCGCTGGGTCGGT 
^ ^. — — _ — — _ — — — — - — — — 

I 1 el I ePnaAaoGIyVatAsnSarAlaPnaHtaCeuTrpCysAanGlyArgTrpValGly 

TACGGCCAGGACAGTCGTTTGCCGTCTGAATTTGACCTGAGCGCATTTTTACGCGCCGGA 

_ _ _ ♦ — — — — — — — ■— — — — — — •*■ — — — — — 

Ty rG.I yG I nAapSarArgLauProSarG 1 uPhaAapLauSarA 1 aPhaLeuArgA I aGl y 

GAAAACCGCCTCGCGGTGATGGTGCTGCGTTGGAGTGACGGCAGTTATCTGGAAGATCAG 
— — — — — — — — — _ ^ — — — — — — — -f — — — — — — 

GluAanArgLauA 1 aVa 1 Ma tVa 1 CauArgTrpSarAapG I ySarTyrLauG 1 uAspGI n 

GATATGTGGCGGATGAGCGGCATTTTCCGTGACGTCTCGTTGCTGCATAAACCGACTACA 

,___—_,___ — — _ — — ^ — — — — — — — — — ^ — — — — — — — — — 

AapMatTrpArgMatSarGlyl 1 aPnaArgAapVa I SarL.«uLauHl aLyaProThrThr 

CAAATCAGCGATTTCCATGTTGCCACTCGCTTTAATGATGATTTCAGCCGCGCTGTACTG 

, __ — — — — — — — «. — — . *> <• — — ♦ — — —> ———> — —.— ♦ — — — — — — — — — ■* — —— — — — — — — •* 

G 1 nil aSarAappharM a Va 1 A 1 aThr ArgPhaAanAapAspPnaSerArgA 1 aVa \ uau 

GAGGCTGAAGTTCAGATG7GCGGCGAGTTGCGTGACTACCTACGGGTAACAGTTTCTTTA 

_ _ _ ___ > ______^_ ____, — —_ — ^ — — — —— — — — — ^ — — — — — — — — 

G 1 uAl aG I uva 1 G 1 oMa t CyaG 1 yG 1 uL«uArgAapTyrU«uArgVa 1 ThrVa 1 SerL«o 

TGGCAGGGTGAAACGCAGGTCGCCAGCGGCACCGCGCCTTTCGGCGGTGAAATTATCGAT 

_ — ____ ___^.^ — — — — — — — — — ^ — — — — — — — — — » 

TrpGlnGi y G I uTnrG 1 nVa 1 A l aSarG I y ThrA I aProPhaG I yG 1 yG 1 ul I «1 1«Aap 
GAGCGTGGTGGTTATGCCGATCGCGTCACACTACGTCTGAACGTCGAAAACCCGAAACTG 

GluArgGl yGl ytyrA I aAapArgval ThrttuArgUaoAsnVa IGluAanProt-ya-.au 
TGGAGCGCCGAAATCCCGAATCTCTATCGTGCGGTGGTTGAACTGCACACCGCCGACGGC 
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^ , * 960 

TrpSerAlaGluI l eProAsnueuTy rArgA 1 ava » Va 1 G 1 uLeuHi sThrA 1 aAspG 1 y 

ACGCTGATTGAAGCAGAAGCCTGCGATGTCGGTTTCCGCGACGTGC^ iQ2Q 

961 ThrueuneGiuAlaGluAlaCysAsovalGtyP^eArgGluValArgneGluAsnGly 

CTGCTGCTGCTGAACGGCAAGCCGTTGCTGATTCGAGGCGTTAACCGT^ iQQ0 

1021 CaCueuueCuiCAlnGWLyaProLauLauItaArgGlyVa 

cctctgcatggtcaggtcatggatgagcagacgatggtgcaggatatcctgctg ii4Q 

1081 P^oLauHiaG^yGlnValM^^ 

cagaacaactttaacgccgtgcgctgttcgcattatccgaaccat^ l2QQ 

1141 QlnA^nAanPhaAa^AlaVa^Ar^^ 

ctgtgcgaccgctacggcctgtatgtggtggatgaagccaatattgaaa^ i26Q 

120 1 LauCy^lDlrgTyrGlyLeuTyrVa \ Va^AspG t uA 1 aAsnl 1 eG 1 uThrMl aG 1 yMa t 

gtgccaatgaatcgtctgaccgatgatccgcgctggctaccggcg^ 132Q 

1261 vIlProMatAlnArgileuThrAsoAsoProArgTrpLeuProA 1 aMe tSarG I uArgv/a I 

acgccaatggtgcagcgcgatcgtaatcaccccagw i38Q 

1321 ThrArgMatCa^GlnArgAsp^ 

GAATCAGGCCACGGCGCTAATCACGACGCGCTGTATCGCTGG^ 

1381 GtuSerGtyrtisGiyAl^ 

TCCCGCCCGGTGCAGTATGAACGCGGCGGAGCCGACACCACGGCCACCGAT^ 

' L4Al SarArgProvi 1 G 1 nty rG*uG 1 yG 1 yG l yA \ aAspThrThrA 1 aThrAsp! 1 ml 1 eCys 
CCGATGTACGCGCGCGTGGATGAAGACCAGCCCTTCCCGGCTGTGCCGAAA^ 

1501 ProMatTyrAlalrg^alAspGi^ .„ .. 

AAAAAATGGCTTTCGCTACCTGGAGAGACGCGCCCCCTGATCCTTTGC^ 

1561 ry^yjTrpLeuSerCauProGlyGluThrArgProLauIIeuau 

GCGATGGGTAACAGTCTTCGCGGTTTCGCTAAATACTGGCAGGCG^ l6QQ 

1621 IVa^lVyllnllr^ 

CGTTTACAGGGCGGCTTCGTCTGGGACTGGGTGGATCAG^ ^ 

1681 ArgueuGlnGWGWPneCaiTrpA^ 

AACGGCAACCCGTGGTCGGCTTACGGCGGTGATTTTGGCGATA^ ia0Q 

1741 AlnGlyAsnPro^rpSerAiaTyr^ 

TTCTCTATGAACGGTCTGGTCTTTGCCGACCGCACCCCGCAT^ ^ 

180 1 PhaCy sMe clanG I yleCva I PhaA^AapArgThrProH* .ProA 1 aLauThrG I uA 1 a 
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AAACACCAGCAGCAGTTTTTCCAGTTCCGTTTATCCGGGCAAACCATCGAAGTGACCAGC 

1861 * * * * ** * 1920 

LyaHiaGI nGlnGtnPhePhaGlnPheArgLeuSerGlyGlnThrl 1 eGl uVa 1 ThrStr 

GAATACCTGTTCCGTCATAGCGATAACGAGCTCCTGCACTGGATGGTGGCGCTGGATGGT 

1921 * * * * * * isao 

Gl uTyrLeuPheArgH 1 sS«rAspAsnG ) uL«uU«uHl aTrpMa tVa 1 A 1 aLtuAspG 1 y 

AAGCCGCTGGCAAGCGGTGAAGTGCCTCTGGATGTCGCTCCACAAGGTAAACAGTTGATT 

1981 * * * * " 2040 

LysProueuA \ aSerGl yG I uVa 1 ProLOuAspva I A 1 aProGl nG 1 yLysG I nLeu1 1 e 

GAACTGCCTGAACTACCGCAGCCGGAGAGCGCCGGGCAACTCTGGCTCACAGTACGCGTA 

2041 * * * * 2100 

G 1 uLeuProG I uLeuProGlnProGluS«rA 1 aGl yG 1 nLauTrpLeuThrVa » Argva I 

GTGCAACCGAACGCGACCGCATGGTCAGAAGCCGGGCACATCAGCGCCTGGCAGCAGTGG 

21Q1 * * * - 2160 

ValGtnProAsnA 1 aThrA 1 aTrpSerG 1 uA 1 aGl yMl si I eSer A 1 aTrpG I nG 1 nTrp 

CGTCTGGCGGAAAACCTCAGTGTGACGCTCCCCGCCGCGTCCCACGCCATCCCGCATCTG 

2161 - * * * * * 2220 

ArgLauAl aG 1 uAsnceuStrVa I ThrLauProAl aA 1 aSeri-H sA 1 a 1 1 eProHlaLeu 

ACCACCAGCGAAATGGATTTTTGCATCGAGCTGGGTAATAAGCGTTGGCAATTTAACCGC 

2221 ♦ ♦ 2280 

ThrThrSerGluMetAapPheCyal 1 eGluLeuGl yAanLyaArgTrpGI nPheAsnArg 

CAGTCAGGCTTTCTTTCACAGATGTGGATTGGCGATAAAAAACAACTGCTGACGCCGCTG 

' 2281 * * * * * * 2340 

GlnSerGI yPMeLeuSerGlnMetTrpI 1 aGl yAapLyaLyaG 1 nLeuLeuThrProl.au 

CGCGATCAGTTCACCCGTGCACCGCTGGATAACGACATTGGCGTAAGTGAAGCGACCCGC 

2341 ♦ * — ^ + 2400 

ArgAspG 1 nPneThr ArgA I aProLeuAapAanAapI 1 eGlyVal SerGluAl aTnrArg 

ATTGACCCTAACGCCTGGGTCGAACGCTGGAAGGCGGCGGGCCATTACCAGGCCGAAGCA 

2401 - * * * * * 2460 

1 1 eAapProAanAlaTrpValGluArgTrpLyaAlaAlaGlyHlaTyrGlnAlaGtuA la 

GCGTTGTTGCAGTGCACGGCAGATACACTTGCTGATGCGGTGCTGATTACGACCGCTCAC 

2461 * * * * * ~ 2520 

A 1 aLauLeuG 1 nCyaThrA 1 aAapThrLeuA 1 aAspA 1 aVa I Leu 1 1 eThrThrA I aMl a 

-i| GCGTGGCAGCATCAGGGGAAAACCTTATTTATCAGCCGGAAAACCTACCGGATTGATGGT 

2521 * * ♦ * "* 2S80 

A 1 aTrpGl nnlaGlnGl yLyaThrLeuPhal 1 eSerArgLyaThrTyrArgl 1 eAapGly 

AGTGGTCAAATGGCCATTACCGTTGATGTTGAAGTGGCGAGCGATACACCGCATCCGGCG 

. 2581 * * * * * * 2640 

SerG 1 yG » nMe t A 1 a 1 1 eThrVa I Aapva \ G 1 uVa 1 A I aSerAapThrProHl aProA 1 a 

CGGATTGGCCTGAACTGCCAGCTGGCGCAGGTAGCAGAGCGGGTAAACTGGCTCGGATTA 

2641 - * * --- 2700 

Arg £ 1 eG 1 yLauAanCy aG 1 nceuA t aG 1 oVa I A I aG 1 uArgVa 1 AanTrpLeuG 1 yLeu 

GGGCCGCAAGAAAACTATCCCGACCGCCTTACTGCCGCCTGTTTTGACCGCTGGGATCTG 

2701 ~ — — * 2760 

Gl yProGInGluAanTyrProAapArgLeuThrAlaA I aCyaPheAapArgTrpAapLeu 

CCATTGTCAGACATGTATACCCCGTACGTCTTCCCGACCGAAAACGGTCTGCGCTGCGGG 

2761 — " " 2 a 20 

ProLeuSerAapMerTyrTnrProTyrVa I PneProSerG luAanG 1 yLauArgCy aGl y 
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ACGCGCCAATTGAATTATGGCCCAOCCAGTGGCGCGGCGACTTCCAGTTCAACATCAGC 

2821 — * — ♦ — ♦ • — " — " * 2880 

ThrArgGluCauA»nTyrGlyProHl»GlnTrpArgGlyAapP*aG1nPhaAanIlaSar 

CGCTACAGTCAACAGCAACTGATGGAAACCAGCCATCGCCATCTGCTGCACGCGGAAGAA 

288 1 — — — — — — — — — — - — — • — 2940 

ArgTyrS«rGlnG1nG»nL«uM«tG1uThrS«rM1sArgrt1aU«uL«uM1»Al«G1uGlu 

GGCACATGGCTGAATATCGACGGTTTCCATATGGGGATTGGTGGCGACGACTCCTGGAGC 

2941 — * ♦ — ♦ — * 3000 

G 1 y TnrTrpLauAan I 1 aAapG 1 y Pn«H1 sMa tG I y 1 1 aGI yGl yAapAapSar TrpSar 

CCGTCAGTATCGGCGGAATTCCAGCTGAGCGCCGGTCGCTACCATTACCAGTTGGTCTGG 

^» ^* ^» ^B ^B- ** •* •* ** ^* ^B ^B ^B< ^B ^B BY ^B ^B •* ^* ^* "** ^P ^ m 4b t» %* ^B* 

Pro5«rVa 1 SarA 1 aGI uPnaG 1 nL«uS«rA 1 aGI yArgTyrHI aTyrGl nLauVa 1 Trp 
TGTCAAAAAgggg*tccgtcg«cctgc»GTGGATACGAGAGCCATGTTCCCAGC^CG^ 

2 ^™ J 1 • ^B ^B ^B ^B BB. BB BB *^ ^B BB b» ^B bb BB BB BB bb BBj BB- bb bb BB BB BB BB bb BB BB BB BB B* *B BB B* *B BB BB) ^B BB BB BB BB BB> <3 4 bb> \# - 

Cy sGI nCyaGl V AapProSarT*rCyaSarG1 y TyrGI uSarMl aVa I ProS-rG 1 yCl y 

ATTCTCACATCAACGAGTCCCGTGGCCACCCCAATACCTGGTACAGGGTATGCATACAGC 

3l2 * * * * * * * 3180 

1 1 al.auThrSarThrSarPo.oVa » A 1 aThrProI 1 aProG I yThrGI yTyrA 1 aTyrSar 

CTGCGTCTGACACCACGTCCAGTGTCACGATTTCTTGGCAATAACAGTATCCTGTACGTG 

31fll * * * * * * 3240 

LauArguauThrProArgProva 1 SarArgPhauauG 1 yAanAanSarl I aLauTy rVa I 

TTTTACTCTGGGAATGGACCGAAGGCGAGCGGGGGAGATTACTGCATTCAGTCCAACATT 

3241 * * * * * * 3300 

PhaTyrSarG I yAsrtGl yProCyaA 1 aSarGI yGl yAapTyrCyal 1 aG InSarAanI 1 a 

GTGTTCTCTGATGAGATTCCAGCTTCACAGGACATGCCGACAAACACCACAGACATCACA 

3301 ♦ * * * * * 3360 

Va 1 PhaSarAapGJ ul-l aProA t aSarGI nAapMatProThrAanThrThrAapI 1 aThr 

TATGTGGGTGACAATGCTACCTATTCAGTGCCAATGGTCACTTCTGAGGACCCAAACTCG 

33*1 — -«> — ♦ 3420 

Ty rva \ G 1 yAapAanA \ aThrTy rSarVa I ProfctatVa I ThrSarGl uAapA I aAanSar 

CCAAATGTTACAGTGACTGCCTTTTGGGCCTGGCCAAACAACACTGAAACTGACTTTAAG 

3421 * * *- 3480 

ProAanva I Thrva 1 ThrA 1 aPnaTrpA 1 aTrpProAanAanThrGluThrAapPfiaUya 

TGCAAATGGACTCTCACCTCCGGGACACCTTCGGGTTGTGAAAATATTTCTGGTGCATTT 

3481 * * * * * * 3540 

CyauyarrpThrcauThrSarGlyThrProSarGlyCyaGluAanllaSarGlyAlaPha 

GCGAGCAATCGGACATTTGACATTACTGTCTCGCGTCTTGGCACGGCCCCCAAGACACTC 

3541 * * * * * * 3600 

A 1 aSarAanArgTnrPnaAapI I aThrVa 1 SarGI yLauG \ yThrA 1 aProCyaThrLau 

ATTATCACACGAACGGCTACCAATGCCACCACAACAACCCACAAGGTTATATTCTCCAAG 
iTalTaThrArgThrA 1 aThrAanA 1 aThrThrThrThrHl aLyaVa 1 1 1 aPnaSarUya 

go^cccgagagcaccaccacctcccctaccttgaatacaactgga^tgct^tcccaat 

A i aProG 1 uSarThrThrThrSarProThrLauAanThr ThrG 1 y PtiaA 1 aAapProAan 
ACAACWCAGGTCTACCCAGCTCTACTCACGTGCCTACCAACCTCACCGCACCTGCAAGC 
3721 * * * * 3780 
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ThrThrThrG t yLauProS«rSarThrMl sVa 1 ProThrAanLauThrA I aProA 1 aSer 



ACAGGCCCCACTCTATCCACCGCGGATGTCACCAGCCCAACACCACCCGGCACAACGTCA 

3781 * + ♦ * ♦ ♦ 3840 

ThrG \ yPraThrVa I SerThrA 1 aAspVa 1 ThrSarProThrProA I aG 1 yThrThrSer 

GGCGCATCACCGGTGACACCAAGTCCATCTCCATGGGACAACGGCACAGAAAGTAAGGCC 

3841 ♦ ♦ + ♦ * 3900 

G I yA \ aSarProva \ ThrProStrProSarProTrpAspAanGI yThrGI uSartysA 1 a 

CCCGACATGACCAGCTCCACCTCACCAGTGACTACCCCAACCCCAAATGCCACCAGCCCC 

3901 * * * * ♦ ♦ 3960 

ProAspMatThrSarSarThrSarProVal ThrThrProThrProAsnA 1 aThrSarPro 

ACCCCAGCAGTGACTACCCCAACCCCAAATGCCACCAGCCCCACCCCAGCAGTGACTACC 

3961 «■ * * * * 4020 

ThrProA 1 ava 1 ThrThrProThrProAsnA I a ThrSarProThrProA 1 aVa 1 ThrTnr 

CCAACCCCAAATGCCACCAGCCCCACCTTGGGAAAAACAAGTCCTACCTCAGCAGTGACT 

4021 * * * <- * * 4080 

PpoThpPpoAsnA 1 aThrSarProThrLauGI yLysThrSarProThrSarA 1 aVa 1 Thr 

ACCCCAACCCCAAATGCCACCAGCCCCACCTTGGGAAAAACAAGCCCCACCTCAGCAGTG 

4081 ♦ * * » * «• 4140 

ThrProThrProAsnA 1 aThrSarProThrLauG I yt_yaThrSarProThrSarA 1 aVa \ 

ACTACCCCAACCCCAAATGCCACCAGCCCCACCTTGGGAAAAACAAGCCCCACCTCAGCA 

4141 * - * * — . * 4200 

ThrThrProThrProAsnA 1 aThrSarPro-Thrt-auGl yLysThrSarProThrSarA 1 a. 

■ GTGACTACCCCAACCCCAAATGCCACCGGCCCTACTGTGGGAGAAACAAGTCCACAGGCA 

4201 . * * ** * * ♦ 4260 

va 1 ThrThrProThrProAsnA I aThrGl yProThrva 1 Gl yG 1 uThrSerProG 1 nA 1 a 

AATGCCACCAACCACACCTTAGGAGGAACAAGTCCCACCCCAGTAGTTACCAGCCAACCA 

4261 - * * * * . * - 4320 

AsnA I aThpAsnHlsThPLeuGI yG J y ThrSarProThrProVa I Va I ThrSarG \ rxPra 

AAAAATGCAACCAGTGCTGTTACCACAGGCCAACATAACATAACTTCAAGTTCAACCTCT 

4321 ♦ * * * * 4380 

LysAsnAt aThr Sar A I aVa 1 ThrThrG 1 yGI nrll sAsnl 1 aThrSarSarSarThrSar 

TCCATGTCACTGAGACCCAGTTCAAACCCAGAGACACTCAGCCCCTCCACCAG7GACAAT 

4381 * ♦ * * * ♦ 4440 

SarMatSart.auArgProSarSarAsnProGluThrLauSarPro5arThrSarAspAsn 

TCAACGTCACATATGCCTTTACTAACCTCCGCTCACCCAACAGGTGGTGAAAATATAACA 
4441 * -r ♦ * »™-« * 4500 

SarThrSarrM sMa cProcauCOuThrSarA 1 aHt s Pro ThrG 1 yG X yGI uAsnl i tThp 

CAGGTGACACCAGCCTCTATCAGCACACATCATGTGTCCACCAGTTCGCCAGCACCCCGC 

4301 * * * * * ♦ 4560 

G I nVa 1 ThrProA I aSar I ! aSarThrrtl si-M sVa 1 SarThrSarSarProA I aProArg 

CCAGGCACCACCAGCCAAGCGTCAGGCCCTGGAAACAGTTCCACATCCACAAAACCGGGG 

4561 * » ~ * ♦ ♦ 4620 

ProQI y ThrThrSarGI nA 1 aSarG I yProGl yAsnSarSarThrSarTnruysProGI y 

GAGGTTAATGTCACCAAAGGCACGCCCCCCCAAAATGCAACGTCGCCCCAGGCCCCCAGT 
4621 ♦ * * * * *- 4680 



G I uVa 1 AsnVa 1 ThrLysGl yThrProProGl nAsnA 1 aThrSarProGl nA 1 aPPoSar 
GGCCAAAAGACGGCGGTTCCCACGGTCACCTCAACAGGTGGAAAGGCCAATTCTACCACC 
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4681 * * ♦ * * * 4740 

G1 yd nLyaThrA t aVa 1 ProThrVa 1 ThrSerTUrG 1 yG 1 yLysA \ aAsnSarThrThr 



4741 



4861 



4-921 



GGTGGAAAGCACACCACAGGACATGGAGCCCGGACAAGTACAGAGCCCACCACAGATTAC 
_ _ _ _ — — — — . ■*■ — — — — — — — — — — -*• 

Gl yGl yLysHi sThrThrGl yrM sGI yA 1 aArg ThrSerTUrG 1 uProThrThrAspTyp 



ACTTCTAGCAAACTGCGGCCCCGCTGGACTTTTACGAGCCCACCGGTTACCACAGCCCAA 
^ _ ^ .» _ _ _ — — — — — — — — — — — — ? 

ThrS#rSerUysLeuArgProArgTrpThrPhaThrS«rProProValThrThrA laGln 

GCCACCGTGCCAGTCCCGCCAACGTCCCAGCCCAGATTCTCAAACCTCTCCATGCTAGTA 
_ — ^ _ + — — — _ — — - ^ — -t- 

A I aThrVa I ProVa I ProProThrSerG 1 nProArgPheS«pAanLauSerMetLeuVa » 



CTGCAGccaagc 1 1 ATCGATGATAAGCTGTCAAACATGA 

4981 * * * 5019 

ueuG 1 nProSerteuSarMtt 1 1 eSe**CysGl nThrEnd 



4800 



GGCGGTGATTCAACTACGCCAAGACCGAGATACAATGCGACCACCTATCTACCTCCCAGC 

4801 * * * * 4 Q60 

Gl yGl yAspS«rThrThrProArgProArgTyrAsnAl aThrThrTy ruauProProSer 



4920 



4980 
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Figure 22 
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Figure 2 3 
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Figure 24 
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Figure 25 : 

Expression of gp350-f ragments as B-cral fusion proteins 
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Figure 26: 

Expression of proteins 
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Figure 28 
A: p47 



GTACGXXQGCTTCTGCTQCTQCTTQXQAICArGGAAACCACTCAQACTCTCCQCi^HAb 

. _ ___——————+—— — — — — — — — — — — — — , ^ _ _ o 

'*<36 a -v— — — ' — — — — - — — 

C AXGCAACCG AAG ACGACGACGAACACXAGXACCXXXGGXGAGXCXG AGAGGCG AAATXC 

MSTTQTLRHK 

ACCAAGGCCCTAGCCGTCCTGTCCAAGTGCtAIGACCAIGCCCAGACTCAtCTCAAGGGA 

'c*cv~a — — + — - „__ 

TGGTXCCGGGAICGGCAGGACAGGXXCACGAXACXGGXACGGGXCXGAGXAGAGXXtXui 

T K A L A V. L S K C Tf D H A Q T H L K G 
GGAGXGCXGCAGGXAAACCXXCXGXCXGXAAACXAXGGAGGCCCCCGGCXGG^ 

" Q O '3 9 — ^ — — — — — — — + — — — — — — — ~ ♦ ™ - — — — — — " — — + — — — — — — — — — + — 

CC-CACGACGICCAIXXGGAAGACAGACAXTXGAXACCXCCGGGGGCCGACCGGCGGCAC 
GyLQyMLLSVNYGGPRLAAV 

gccaacgcaggcacggccgggciaaxcagcxxcgaggxcxccccxgacgcx^ 3oi(j3 

80049 caGXTGCGTCCGXGCCGGCCC^ 



I 

LlSEEWSPOAy 



1-1 — 



30103 



A N A G X A G 

IGGCAGAATCACCAGAGCCCAGAGGAGGCCCCGGCCGCC 

q + + + "** ^ 

' ACCGXCxTAGIGG-CTCGGGTCTCCTCCGGGGCCGGCGGCACAGTAAA-CTTTGGAAtGG 



aYSERNLA 



UQNHQSPEBAPrh 
" I ACGGGCGCACCTGTGICCTGGGCAAGGAGCTGTTTGGCTCGGCTGTGGAGCA^ 

80169 AlGCCCaCGIGQACACAGGACCC^ 

t)i aTCWtG-KELBG3AW-BQA3 



CTaCAAITtTACAAGCGGCCACAAGGGGGXTCCCGGCCTGAAITIGXTAAGCTCACTAlG 

gIcgttaaaatgxicgccgg-GT. 



30-33 



3022? 

GACGITAAAATGITCGCCGGIGITCCCCCAAiiiiij-wbUHu* i««»wn- 4 

2CXAXAXGCCC 

30348 

3 023? 

CTXAXACXACXAXXCCACAGGXXCTC(jfiXb«3i uiijiaM^uug 

E Y D 0 . K V 3. . K 3 H H X C A U ? Y rt P 




C 

3034? 

GGCCGG 




30408 



?A3DR-LftN 



Q rt t G Q l > - - h ? K 



acoGctxccxcgxt gc asaagtgggc acgccaisca 30j|M 

3040 ' TGCCGAAGGAGCAACGXCX^ 
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pi 



- 2 - 



I046' ; 




L * 




1 1 



Y K P L 3 V a ? f E * 5 1 " ' v A * A ° 




H y g C 3 V A A D 3 L A A 
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L S L 




IIA yyA'3LLX3 



A 

it 



AGIGTTATTTTATTTAACCACGCCICCGAAGAGG^ 



TCACi 

cyiLFMHASEE 



303 




EOKSPS^ 0 




p p ? P ? st x 



U H 3 l* 
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31069 +— - — «■ + *"~r:r: 31123 

TTCGTGGGGTTCTTTCACTTCGTCCGGAAAITGGGGGAGTAAAIIGIGGTACAAtAiaLAL 
KHPKK.VKQASNPLIA 

CAAGCAGCACCT 

311-'? - + * SI 140 

GTTCGICGTGGA 
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AAGXGGXXCAGXGGACACCCACCACACAGCAIGGCAaCGACCAGXCAIGXCGAGCaXGAG 

, rtrtU * JJ + + 7<S-*36 

io o / / +" *" ^ 

XXC ACCAAGXCACCXGXGGGXGGXGXGXCGXACCGXXGCXGGXCAGX ACAGCXCGXACXC 

MAXX3HVSH3 

CXCCXCXCCAAAXXGAXXGAXG AGXXA AAGGXCA AGGCCAACXC AGACCCCG AGGCTG AX 

j « — ^ +> +■ 7 f j4'V ( i 



ijtj/ — * 



G AGG AGAGGXXXAACX AACXACXCAAIXXCCAGXXCCGGXXG AGXCXGGGGCXCC3ACXA 
LL3KLI0SLKVKAMSDPEAD 
GXCCXGGCCGGGCGCCXGCXCCACCGCCXXAAGGCCGA6XCAGXXACACACACAGXAGCC 



'f* « oi — — — ^ — <— — — — — .— — — ^ — — — — — — — — — ^ — — — — — — — — — — — — — — — ^ 

CAGGACCGGCCCGCGGACGAGGXGGCGGAAXXCCGGCXCAGXCAAXGXGXGXGXCAXCGG 



y L A G S L L H R L K A S 3 V T H X V A 

G AAX AICXGGAGGXCXXCXCXGAC AAttXXCXACG AXGAGGAAXXCXXCC AGA XGCACCGG 

7 £ZZ7- + + * * * * 7GGib 

CXXAXAGACCXCCAG AAGAGACXKXX AAGAXGCXACXCCXI AAGA AGGXCXACGXGGCC 



c 



Y L E 



VESDKSXDEEEEQHHR 



GAXGAGCXGGAGACCCGAGXCXCXGCXXXCGCGCAGAGCCCGGCCXACGAGCGCAXCGXC _ 

7G617 — + - * * * * 

CXACXCGACCXCXGGGCXCAGAGACGAAAGCGCGXCICGGGCCGGAXGCXCGCGXAGCAG 



DELEXRV3 AEAQ3PAY E R I V 

XCCAGCGGCXACCXGXCGGCCCXGCGCXACXAXGACACCXAXCXGXAXGXGGGGCGCAGC 

-,-,--7 + _ + + + + * ' J * 

^GXCGCCGAXGGACAGCCGGGACGCGAXGAIACXGIGGAIAGACAIACACCCCGCGXCG 



A< 



3 3 G Y L 3 A L ft Y Y 0 X Y L Y V 'a K 
GGGAAGCAGGAG AGXGXGCAGCACXXXXAC AXGCGGXX AGCGGGCXXCXGXGCCXCAACC ^ 

, — — — — — *————————— -r — — — — — — — — — i — — — — — — 

^ ^* ^ <B» *W >^ <M ^» ^ *™ — * ^™ *™ ^ ^™ ^ ^* ^ ^* ** 

CCCXXCGXCCXCXCACACGXCGXG AAAAXGXACGCC AAICGGCCG A AGAC ACGGAGXXGG 
,3 K Q E 3 V Q H E Y n R- L A G E C A 3 X 
ACCTGCCXCXACGCGGGXCXCAGGGCAGCCCXGC AGCGGGCCAGGCCGGaGaX XGAGAGX _ ^ 

_^_ — — — — — — — ~>^ — » — — — — — -» — ^ — — — — — — / l^Orjt^ 

7,3707 — + 

TGGACGGAGAXGCGCCCAGAGXCCCGXCGGGACGXCGCCCGGXCCGGCCXCTAACICICA 

xclyaglraalqrarpeiss 

GACAXGGAGGXGXXXGAXXACXACTXXGAGCACCXAACCXCCCAGACGGXGXGCXGCXCC 

. _—————+———— — « — — — ^ — — — — ~ / < J • J 

** G S 5 7 — — • ~ ^— — — — — — — — — ^— — — — — — — — — * 

CXGXACCXCCACAAACXAAXGAXGAAACXCGXGGAXXGGAGGGXCXGCCACACGACGAGG 

0 » E V E D Y t E 6 H L X 3 Q X V C C 3 

ACGCCCXXXAXGCGCXXXGCCGGGGXGGAAAACXCCACXCXGGCCAGCXGCAXCCXCACC ^ 

— — — — — — ^ — — — — — — — — — — — — — — • o * / 

( .- oj^** — — — + — — — — — — — — — *» — — — — — — — — — — — — — — — — — ^ — — — — 

TGCaeGAAATACGCGAAACGGCCCCACCTTITGAaGIGAGACCGGTCGACGXAGGAGtGG 
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ACCCCCGACCXCAGCXCCGAGXGGGACGXGACCCAGGCCCXCX AIAGGC ACCXGGGGCGC 

„ _ _ _ „ + ---+ + — «i 77026 

TGGGGGCXGGAGXCGAGGCXCACCCXGCACXGGGXCCGGGAGAIAXCCGXGGACCCCGCG 

xpoLS-SEUOVTQALtRHLGS 

TAfCTCXXXCAGCGAGCCGGGGXGGGXGXAGGGGXGACGGGGGCXGGCCAGGAXGGGAnA 

77037 „I V * * * + v 770*6 

AXGGAGAAAGXCGCXCGGCCCCACCCACAlCCCCACiGCCCCCGAeCGGXCCTACCCTTX 

0 -f l ? Q a a g y g y g v i g a g q o g k 

r AC AIf*AGCCXCCXGAXGAGGAXGAXC AACAGCC ACGXGG AGX ACC ACA ACXAXGGCXSC 

7 70^7 * + * r " 77U ' 3 

GXGX AGXCGG AQGACX ACXCCXACXAGXXGXCGGXGCACCXCAXGGXGXXG AX ACCGftCG 

c HlSLLMftrtlMSHVEYHNYGC 

AAG AGGCCGGXC AGXGXGGCGGCCXACAXGGAGCCCXGGCACAGCCAG AIXXXC AAGXXX ^ ^ 

■ ____ — _ — — ■«. — — ■— — — — — — — + — — — — — ~ — ~ + ~ ~ — — / / H I '3 

"? *-• ^ iT« — — — 4, — — — — — — + — — — ~ ~ T 

TXCXCCGGCCAGXCACACCGCCGGAXGXACCTCGGGACCGTGXCGGXCXAAAAGXTCAAA 
c KRPVSVAAfMEPWHSQIEKS 

r- nr: ^ a AC G A AGCXGCCGG AG A ACCACG AG AGGXGCCCGGGCAXCXXXACGGGGCXC XXX 

+. •*■ + //-/6 

1 r^CTXXGCXXCGACGGCCXCXXGGXGCXCXCCACGGGCCCGXAGAAAXGCCCCGAGAAA 

c LEiKLPEiSHSaCPGIfTGLE 

GXCCCCGAGCXCXXCXXCAAGCXXXXXAGGGACACGCCCXGGXCGGACXGGXACCTGXXX _ 

' "T ~ *7 "? — — — ^ — — — — — — — — — * — — — — ——— — — ^ — — — — — — — -. — •* T 

' caGGGGCXCGAGaVgaaGXXCGAAAAATCCCXGXGCGGGACCAGCCTGACCATGGACAAA 
c ypELEEKLESOTPUSDUYLE 
GACCCCAAGGACGCCGGGGACCTGGAGAGGCTCTACGGGGAGGAGTTTGAGCGCGA^ 
' " 0/ C7G3GGXXCCXGCGGCCCCXGGACCXCXCCGAGAXGCCCCXCCXCAAACTCSCGCXCAIG 
c o P K 0 ' A 'G -0 L E 8 'u '( G S E 5 2 • 5 S t 

TAICGGCXGGXGACAGCGGGCAAGTTTIGXGGGCGGGXCXCCAICAAGXCCCXG ATGT7C ^ 

— — — —— — — — ^ — — — — — — — » — — — — — — / / "liju 

" ' AIAGCCGACCACXGICGCCCGXICAAAACACCCGCCCAGAGGXAGXTCAGGGACTACAAG 

c * R L V T A G K t C G ft V 3 I * 3 L ri 5 

tCXAICGXCAACXGCGCCGXCAAGGCCGGCAGCCCCTXCAXCCXXXXGA*^ 

77457 AGATAGCAGXTGACGCGGCAG^ 
c 3 i'JNCAVKAG'3?? iulksac 

AACGCCCACTTTTGGCGCGACCTGCAGGGCGAGGCCATGAACGCCGCCAACCTGTGCGCC 

773 TTGCGGGTGAAAACCGCGCTGGACGTCCCGCXCCGGTACTT 
c NrtHEU. SOI-QGEAriMAAMLCA 

GAGGXGCXGCAGCCCXCGAGGAAGTCTGTGGCCACCTGCArtTCTGGCCAACATCTGCCTC 
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/ / yj / / ———— — — — — — — — ^ — — —— — —— — — — — " — —— —— — — - / . IJwO 

CTCCACGACGTCGGGAGCTCCTTCAGACACCGGXGGACGXXAGACCGGITGTAGACGGAG 

E V L Q P 3 ft K 3 V A X C H L A rl I C L 

CCGCGCTGCCTGGTG A ATGCGCCTCIGGCGGTGCGGGC AC AGCGGGCCG AC AC GCAGGGG 
/ / Ijw/ — — — *«* - .• /oib 

P a C L V M A P L A y R A Q S A 0 X Ct 6 

77697 * * * + 77756 

C XACIXGAGGACGACCGGGAGGGAGCXGA6AGXCAGXGGGAXGGACCXCXCCCCCGXC AG 

D E L L L A L P ft I 3 V T L P 6 S G A V 

GGXGAXGGAXXCXCGCXAGCCCGCCXCA6AGAXGCCACCCAGXGXGCC ACC7TXGXGGXG 

77737 * + + * » 77316 

CCACTACCTAAGAGCGAICGGGCGGaG XCXCTACGGXGGGXCACACGGXGGhAACACCAC 

G 0 G t "3 L A ft L R 0 A T Q C A T E 'J V 

GCCTGCXCCAXXCTTCAGGGAXCCCCCACXXAXGAXXCCaGGGAXAXGGCCXCCAXGGGC 

77317 + * + + - 77376 

CGGACGAGG XAAGAAGTCCCTAGGGGGiGAAXACTA AGGICCCTAT ACCGGAGGTACCCG 

a c 3 r l q g 3 p r r c» s ?. o ;-; a 3 n g 

CTCGGGGTGCAijGGCCTGGCCGATGTCTTTGCGGACCTGGGCTGGCi-tGTACACTGACCCT 

/•/■>// ~ — — -r — — — — — — — — — -r — — — — — — — — — — — — — —— — — — — ——— — — — — — — — — — — — — — — — — — — -r — — — — — — , . 

/ GAGCCCCACGTCCCGGACCGGCXAChGAAACGCCXGGACCCGACCGXCATGXGACXGGGa 
- • L G " V a G L A D y s A 0 f u G U Q Y X D ? 

CCCXCXCGCXCGXTAAACAAGGAAAXAXXCGAACaX AXGXACTTTACGGCCCXC7GCACC 

77*937 + + * * * 

GGGAGAGCGAGCAAXXXGXXCCXXTATAA 6 CXTGIAXACaTGAAAIGCCGGGAG AC GIGG 

? :£ & 3 L H K E I c £ H fi \' E X A L C T 



AGXAGXCXGAXXGGACXrCACACCAGGAAGAXXXXXCCGGGXXXCAAACAGAGCAAGXAX 
+ * * ^ , 73036 

XCAXCAGACTAACCTGAAGXGXGGXCCXTCXAAArtAGGCCCAAAGXXXGXCTCGXXCArA 

3 3 L I ' G L H X R K I E P G E K a 3 K f 

GCCGGGGGGXGGXTTC ACXGGCACG A XXGGGCAGGA aCAGACCXXICXAXXCCCAGGGAA 

73057 + + + * * + 73116 

CGGCCCCCCACCAAAGXGACCGXGCXAACCCGXCCXTGXCTGGAAAGATAAGGGXCCCTX 



GGUEHUH0UAGX0L 3 IP RE 

AXXXGGXCXCGCCXCXCTGAACGCAXXGXGAGGGAXGGGCXXXXCAATXCACAGXXXAXC 

73117 + 1- + + + 73176 

XAAaCCAGAGCGGAGAGACXTGCGXAACACXCCCXACCCGAAAAGTXAAGXGXCAAAXAG 

I y 3 ft L 3 E R I P. 3 G L E N 3 Q E I 

GCCCXGAXGCCCACCXCAGGCXiiTGCCC^GGXGMCGGGCXGXXCGGACGCCXXCTACCCC 



C G G G A C X A C G Go X n G A G X C C G A C ;^ C G G G T C C A 0 X G C C C G A C A G C C X G C G G A A G A X G G G G 



73236 
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QV TGC3DAEYP 



ITCTAXGCCAAXGCGXCCACCAAGGXCACCAACAAGG AGGA^ ? ^ 

73237 T^Ic^^ac^a^ 

E Y ft M A S T K W T M K E E A L R P N 3 
tCTTTTTGGCGXCAT67GCGtCTGGATGACAGGGAAGCITTGAAT« 
?32 ' 7 aGAAAAACCGCAGTACACG^ 

s E „ a h « a u o o » s a l m «■ y s « a 

GTCTCCIGCCTCCCGGAGGCtCTGCGGCAGCGCTACCTGCGTIICCW 

73357 cagIgmcggagggcckcgaga^ 

u s-'c- l p e a l » Q a v «• * E Q T * E 0 ; 

TACA ACCAG5AGGACCXGAXTC AGAXGXCCCGGGACAGGGC^ ?g47S 
78 ' 17 ATGITGGICCTCCIGGACTAAG^ 



0 L I Q N 3 8 D R A P E V 0 « 



CAAICXCACAGCCTGXXTTTGCQXGAGGAAGAXGCCGCGCGGGCCAGCAC^ ^ 



36 



Q 5 H 



L a E E 0 A A R A 3 T L A N 



CXACTGGXGCGCAGCTACGAGCTGGGCCTGAAGACXATCAXGX^ y ^ 

L L V R 8 t B L G U K I I H t t C .» I S 

AAGGCCGCCGAICXGGGGGXGAXGGAGXGXAAGGCCAGCGCGGCXCXGTCGGXGCCG^ ^ 

73S?7 ;«MGMGC«G«C«MCM«K« . 
K A A 0 L G U i*. E C K A 3 A A L 3 V 

gaggaacagaaigagcggagxcccgcxgagcagaigccgccxcgtcccaxgg^ 7371 . 

73657 CTCcixGTCXXACXCGCCXCAGGG^ 



E Q 



3?ASQH p?P - ?rtEP 



caggxxgcggggccggxxgacaxcaxgagcaagggcccaggggagggaccaggxg^ 7g ._. 

73717 awSAMCKcScCA^ 



(10 t M S K G P « «>• " r " " 



tg-gtgcccgggggaitggaagxgxgcxaxaagxaccgica_g^^ 7g336 
73777 acaSmggmmS*^ 

C v P G G L 8 « C * K t a Q L _B 3 8 0 0^ 
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+ + + 783*6 

73337 QACAACCTCTG^ 

L LSID»3ETSRACSSCQ« 

gccaigiccaagtigitg 

73397 + - 735*1-- 

CGGTACAGGITCAACAAC 
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aQ , 0 A aa CAAACAtACAGQA M A^™ 1 

1320 ;;~«c«;5w;^ 



373 



CCTCTCCG 
13 E A M 




y H R S 



R P s b a ■ o r Q 



>. ii r, ft Q B '> B c v n - - 

...... .- GCCC TTC«iCA^XACGAtTCCi3QCCAaa*-i ^ 

^0 ► ::::::;;"^A«CTSIT6C6liGAAaCQTTCAiaCTAAa6».CQU 4 CC«L 
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QCGQCGSGCTTCCASAGrjQCCCTTCTGGTQQCCQQQCCAQAQACCQCTQACACiSAQQCCG ^ 

^ — _ — . — — — » — — ^ — — — — — — — — —•$• — — ~~ — — — — — — — — — — — — — — l+ >J *J ^ 

500 cSCci3CCCQAAQGICTCCCaGGAAGACCACCQ6CCCQ3TCTCTSQCGACTGTQCtCC63C 
-AQEQRAL.LVAGPETADtaP 
GACCXCCGCAAaCXGAAX3AGX33GX3XXX33X33CA33GCX8^ 

360 «GQAGGCGTTCaACTIACTCACCCACAA 



OUSKLNEUVf 3 G 



p. A A G G a Q L 



n80 + 



GCCGAC3AGCTAAAGAICGTGXCCGCGC"GCGAGACACTtACXCGGGCCAC^ ^ 

420 cggctgcxcgItttctagcacam 

ft D E U K I V 3 A L .* D I Y 3 6 H L W L 
CAGCCCACGGAGACCCITGACACArGGAAGGtGTTQAGCAGGGACACACQM 

«CaGGIGCCTCTGQGAACTGTGM 

qptbtlotukw'lssotrtah 
AasiiaaAGCACGGAiscAtTCATGCcacGaaaACCAiccAGaccAACTGC^ 25gg 
40 xcIaacckgmckaagmagmmgm 

SUBHGEIHAAatl.GAN-CPOL 
ITIAXGAGACGCCTGCACCCCGGCCTCITTCCCTTCGITAATGCAATAGM ^ 

*°° a;«acktgcggtcgmgggccggm 

e « a a a h p a l e ? e « n a i a 3 s 'u 

GGCXGGXAC-ACCAGACCGCCACCGGCCCCGGAGCAGAXGCCAGGM ^ 
:& ' 3<> CC3ACCAX3AXa3XCX3GCG^G3C^ 



-J 



Q U Y t Q X 



GPGADARAAARP. 



caacaggcctttcagaccagggcggcggctgaatgccatgccaaaagcgg^ 27 

;;;g«cggIIa«cm«ck^ 



» 2 C H A K 3 a V ? '» 



UQAEQXRA 

GTGGCCGGCtTCTACAGGACCAICAACGCCACGCTCAAGGGAGGAGM ^ 

mccggccgIag««cc«gmgttm 

y A 3 E Y R X I M A T L K a G. E 3 L * P 
ACXAXGIXXAACGGGGAGCXGG3GGCCATCAA3CACCAG^ ^ 

x n f m a e t. a a 1 k h a A l 0 X v » Y 

MCTACSaCCACtAtCTMtAATGtTGGaacCATTCCAGCCATGGAaCGGACr.SACGGCC 



50/64 



0173254 



- 11 - 

;'?00 ^ * **" * 

CXGAXGCCGGXGAXAGAGXAXXACAACCCCGGXAAGGXCGGXACCXCGCCXGACXGCCGG 



D Y G 



HYLIMLGPEQP«SGLXA 



CCXCCG XGCCCCXAC'GCCGAAAijIXCAXGGGCACAGGCGGCCGXGCAGACGGCCCXCGAG ^ ^ 



4. <r 301? 



- 560 GQaGGC ACGGGG AXGCGGCXXXCAAGXACCCGXGXCCGCCGGCACGXCTGCCGGGAGCXC 



P C P X A 5 3 b 



UAQAAVQXALE 



CTGI-CTCGGCCCtGTACCCGGCCCCGXGCAICtCGGGCTACGCGCGCCCCCCGGGCCCC 

„ ^ ^ J ^ ^ _ H + . + — 7 

^ GACAAGAGCCGGGACAXGGGCCGGGGCACGXAGAGCCCGAXGCGCGCGGGGGGCCCGGGG 

L c 3 A L X P A P C I S G Y A R ? ? G P 

AGXGCIGXGAXCGAGCAXCTGGGG XCCCXAGXXCCAAAGGGGGGXCTGCXGXTGXXXCXG 

3080 * * ♦ * .~ + * 

TCACGACACXAGCXCGXAGACCCCAGGGAXCAAGGXXXCCCCCCAGACGACAACAAAGAC 



■3 ri f 



IEHLG3LVPKGGLLLEL 



ICXCACCXACCGGAXGAXGXXAAGGACGGGCXCGGAGAAAXGGGGCCGGCCAGGGCCACG 

*» i » > — — — — — » — + — * — — — ~ ™~ + ~ ~ ™ """ ™ — — ^ »» — — — *■ • ~ L *V 

' agagtggaxggk^ 



n< 



•3 



H ' L ? 0 D V K OG LGEMG P A R AX 



ftfiACCTGGAAIGCAGCAGTTXGTCAGCAGCTACTTCCTCAACCCCGCCTGTTCCAALbTC ^ 

* + * 

rcXGGACCXT ACGXCGTC Art ACAGXCGTCGATGAAGGAGXXGGGGCGGAC AAG6XXGCAG 



p rj rt Q Q c V 3 3 Y E L N P A C S M V 



ITCAIXACAG X GAGGCAGCGAGGGGAGAAGAXCAACGGCCGXACCGXCCXCCAAGCGCXC 

^ . _ _ . ^ _ _ • — — — — — ^ — » — » ™ -™ » — + ™ — — ™ — — ^ » — — ^ » OO 1 y 

* lAatAATaTCACTCCQTCQCTCCCCICTtCTAQTtaCCQQCATSaCAaQAQQTTCQCOAQ 



c X X 'J a a a G 2 K I N g a X V I Q A I 

GGACGCGCAXGCGAXAXGGCAGGCXGCCAGCACXAXGXGCXGGGCXCCACGGXXCCCCXC 

j. + * ■* ; °' ' 

CCXGCGCGXACGCXAXACCGXCCGACGGXCGXGAXACaCGACCCG AGGXGCCAAGGGGAG 

13 & A C 0 rt A o C Q H Y V L G 3 X U P L 
GGXGGACXCAACXXXGXCAACGACCXGGCGXCCCCGGXXXCCACCGCCGAGAXGAXGGAX 

+ 4--- + ■JnO, 

' CCACCIQAaXTQAAACAGITQCtRQACCQCAGQaGCCAAAaQTOQCQQCICXACtACCTA 



r, a L N E 'J N 0 L A 3 P « 3 T A E rt rt D 
aAXXXCXCXCCCXXCXXCACC8X88A8XXXCCCCCSATXCAAGAGGA^ 



i440 + * * * * 



cxAAAGAaAaGGAAGAAGxaacAccxcAAAeaaaacxAAaixcxccicccacaixcAAGA 



DE3?E?XVEE'r-PIQ EE,3,:i 



5 3 



CMGXACCCXIAiMTSTaGAeaAGAGCAX38ACAlCXCTCC^ 

3300 aaocAxaaanxcxIcAccxacxc^ 
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P V ? L D 'J D E 3 H G I 3 _ ? 3 '( Z I P J 

CICTCGCIGGAilTCATGCC-rCACAAijCAXCCrGTCACACCCCACCGIGGijAMilCAAGG^G 

560 ^ 1 * * 36 i"? 

GAGAGCSi^CXCAG-XACGGAGXGXXCGXAGGACAGXGXGGGGXGGCACCCXXCGXXCCTC 



U .3 U - O L. 



X 3 I I 3 H P T y G 3 K 5 



CACTTGGICAGGCACACGGACAGGGTCAGCGGAGGACGCGTGGCACAGCAGCCCGGGGXA 

. — — — 3 b / "V 

' "? (J ^ — — — — — — — — — — — * ™ 

GIGAACCAGTCCGTGTGCCTGICCCAGXCGCCTCCrGCGCACCGTGICGTCGGGCCCCAl 

H L y a H I D ft V 3 G 5 R V A Q Q ? G M 

GGTCCCCTGGACCTGCCGCTGGCGGACTACGCCTTCGTTGCCCACAGTCAGGTCtGGACC 

3£3q «. + t* «■ ■*■ " 

CCAGGGGACCXGGACGGCGACCGCCXGAXGCGGAAGCAACGGGXGXCAGXCCAGACCXGG 

QPLD'LPLAOYAHV.AHSQyWT 

AGGCCCGGXGGGGCXCCXCCCXXGCCCXAXCGXACCXGGGAXCGAAXGACAGAGAAGCXG _ 

3740 * + -r + + * 

XCCGG GCCaCCCCGmGGaGGGAACSG G AXAGCAXGG ACCCXAGCXX ACXGTCXCXXCGaC 

R p G G A P ? L P Y ft T U 0 R M I ' £ K i. 

CXXGXCXCC3CAAA ACCCGGCGGAGAGAACGXXAAGGXXXC AGGXACCGXGAXX ACAXXG 

3 *3 1) >__———— — — — — — — — — — — — — + — ~~ — — — — — — t 

GAACAGAGGCGXXXXGGGCCGCCXCXCXXGCAAXXCCAAAGXCCAXGGCACXAAXGXAAC 

L g 3 A K P G G E N V K - V* 3 G "X V I 
GGAG AACAGGGGXACAAAGXGXCGXXGGAXCXGAGGGAGGGAACCAGGCXGGCAaXGGCT 

* * + + + * 

CCXCXXGXCCCCAXGXXXCACAGCAACCXAGACXCCCXCCCXXGGXCCGACCGXTACCoA 
QEQGYKWSLOLR.E G X R L A n A 



33G0 



GAGGCGCXGCXGAACGCAGCAXGXGCCCCAAXCXXGGAXCCGGAAGACGXCXXGCi-ALw 

. — _ — — ^ — ^ — — — — — — — — — — — — — — — — J V 

^ ^ ^ ^ — ^ ^ ^ ^ •* ™ ^ ^* 4* — * ^» * » ^ ™ ^ * ^ T 

CXCCGCGACGACXXGCGXCGXACACGGQGXXAGAACCX AGGCCXXCXGCAGAACGAGXGG 



SALLNAACA 



& I L 0 P 3 D y L u X 




tHLDPRRADMSAVMEArtT 




A A 



D T A R G L G y K L X c 'i o 



° A 3 C 




P 3 X >i 3 3 A 3 f n r V y A 3 y 3 A 

GGGGAAXXCXCGGGXCCXCXGAXCACOCCAGXGCXTCAGAAGAOGGGCAGXCXCCXGAXX 
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421-5 



4loO + + " 

CCCCXXAAGAGCCCAGGAGACTAGXGCGGXCACGAAGXCXXCXGCCCGXCAGAGGACXAA 



ESGPLITPV LQKTGSL-LI 



GCGGXGCGXXGCGGGGAXGGCAAGAICCAGGGAGGGXCGCXGXXXGAGCAGCXCXXXAGC 

42C0 + + * * * * " 

CGCCACGCAACGCCCCTACCGTTCTAGGICCCTCCCAGCGACAAACTCGTCGAGAAATCb 



A V R C G 



0GKIQGG3LEEGLE5 



GACGXGGCCACGACCCCACGGGCACCCGAGGCGXXGXCXCXGAAGAAXCXCXXCCGGGCA 

4230 + * * * * 4333 

CX6CACCGGXGCXGG6GIGCCCGXGGGCTCCGCAACAGAGACXXCX7AGAGAAGGCCCGX 



OVA 



ITPRAPEAL3LKNLSRA 



GXCCAGCAGCXGGXCAAGAGCGGCAXCGXGCXGXCAGGGCAXGACAICAGCGACGGGGGC ^ 

4340 ^ + - + * * 4o * 9 

CAGGXCGXCG ACC AGXXCXCGCCGXAGCACGACAGXCCCGX ACXGX AGXCGCXGCCCCCG 

VQQLVK3G I V L 3 G H D ISDGG 
CXGGXGACCXGCCXGGXGGAGAXGGCCCXGGCCGGGCAGCGGGGAGXGACCAXCACXAXG 



L * -.--..w 44S<n 

^400 * * + * „ 

GACCACTGGACGGACCACCXCTACCGGGACCGGCCCGXCGCCCCXCACXGGXAGXGATAC 

LV'XCLWEHAL-AGQRGVX I X M 



rCGGXGGCCXCCGACXACCXCCCGGAGAIGIXXGCAGAGCACCCCGGCCXGGXGXXXGAG 

4A60 «. * * * * - — + 

GGCCACCGGAGGCXGAXGGAGGGCCXCXACA AACGXCXC6XGGGGCCGGACCAC AAACXC 

? V - A' S "D "*Y L P E M E A E H P G L V E E 

GXGGAGGAGCGCAGCGXGGGXGAGGXGCXGCAGACCCXGCGCXCCAXGAACAIGXACCCG 

45^o - + + * v * 4j/? 

CACCXCCXCGCGXCGCACCCACXCCACGACGXCiGGGACGCGAGGXACXXpXACAXGGGC 

* 

ygSKSyGEVLQXLaSHNMYP 
GCAGXCCXCGGXCGAGXGGGCSAGCAAGGXCCAiSAXCAAHXGXXXGAGGXGCAGCACGG^ 
4j30 CGXCA6GAGCCAGCXCACCCGCXCGXXCCAGGXCXAGXXXACAAACTCCACGXCGXGCCG 
AULGRVGEQGPOGME. EVQHG 
CCAGAGACGGXGXXGCGCCAGXCGCXGCGCCXGCXGCXGGGAACCXGGXCAXCCXXXGCC 
GGXCXCXGCCACAACGCGGXCAGCGACGCGGACGACGACCCXXGGACCAGXAGGAAACGG 

PSXVL&QSL. RLLLGXU3SEA 

AGCGAGCAGXACGAGXGCCTGCGACCAGAXCGGATXAACCGG XCCAXGCACGXGXCCGAC ^ 
- —————— — ———— — •* / J— 

4/00 XCGCtCGXCAXGCICACGGACGCXGGTCTAGCCT 



4<S?3 



ssqyeclr'porlnrs-uhvs 



XACGGCXAXnACGA^ACXGGCAiiXCXCCC^ 

Imccgaxaxxg^ 



53/64 



0173254. 



- 14 - 



Y N E A L A V 3 P L X G K ft L 



CGGXXGGXGACAGAGCCXGACCCACGAXGTCAGGXGGCCGXGCXAXGC3CCCCGGGCACC 

4320 +■ + + + * 4379 

GCCAACCACTGICTCGGACTGGGTGCTACAGXCCACCGGCACGAIACGCGGGGCCCGTGG 

R L V X S ? D P R C Q V A V L C A ? G T 



AGGGGCCATGAAAGCCTCCTGGCGGCCTTCACGAwTGCCGGATGCCTGTGCCGhCGGGTG 

4330 * + + * 

XCCCCGGXACXXTCGGAGGACCGCCGGAAGXGCXXACGGCCXACGGACACGGCTGCCCAC 

RGHESLLAASTNAGCLCRRV 

XXCTXXCGCGAGGXXAGGGACAACACGXXCCXCGACAAGXACGXGGGXCXGGCCAXCGGA 

4940 + +■ * + 

AAGAAAGCGCXCCAAXCCCXGXXGXGCAAGGAGCTGXXCATGCACCCAGACCGGXAGCCX 

EEREVRDNTcLDKYVGLA I G 

GGAGXXCAXGGGGCCAGGGACTCXGCCCTGGCAGGCCGXGCCACCGXGGCGCXGAXXAAT 

5000 + + * * ♦ + 

CCXCAAGXACCCCGGTCCCXGAGACGGGACCGrCCGGCACGGXGGCACCGCGACTAAXTA 

G V H G A 3 H.-* 3 A L A G R A X V A L IN 

CGXTXCCCCGCCCXGCGXGACGCXAXXCXAAAGTICCTCAACAGGCCAGAXACGXXCXCG 

5060 + — + ^ * + 

GCAA AGGGGCGGGACGCACXGCGAXAAGA-XX XCAAGG AGXXGXCCGGXCX AXGC AAGAGC 



P A L R 0 A ILKFLNRPD 



GXGGCCTXGGGGGAGCXGGGGGXGCAAGXXXXGGCXGGCCTGGGGGCCGXGGGGXCAACA 

5120 + + + + * 

CACCGGAACCCCCXCGACCCCCACGXXCAAAACCGACCGGACCCCCGGCACCCCaGXXGT 

•J A L G H L. G 'J Q V L A G L G A V G 3 X 

GAXAaTCCACCCGCCCCXGGCGXGGAAGXTAaXGXCCAGAGAXCACCXCTGAXXCXGGCC 

5130 + ♦ ^ * * 

CXAXTaGGXGGGCGGGGACCGCACCXXCAAXXACAGGTCXCXAGXGGaGACX ArtGACCGG 

0 N P ? A P G y 2 V H V Q R 3 ? u I L A 

CCCAACGCCrCXGGCAXGXTXGAGXCCCGCTGGCXGAACAXXAGCAXCCCGGCGACCACC 

5240 * + * * * 

GGGTXGCGGAGACCGTACAAACTCAGGGCGACCGACXIGXAAXCGXAGGGCCGCXGGXGG 

PNASOiiESSRULN 13 IPAXX 

AGCTCTGTCAXGCXGCGXGGCCTCCGGGGCXGCGTCCTGCCXXGXTGGGXGCAAGGCXCG 

5300 * * * + * 

XCGAGACAGXACGACGCACCGGAGGCCCCGACGCAGGACGGAACAACCCACGXXCCGAGC 

3 S V h L R. G R G C V L P C U V Q G 3 

XGCCXGGGCCTGCAAXTTACTAACCXCGGGAXGCCAXATGXXXXGCAGAAXGCCCACCAG 

5360 + + + ♦ + 

ACGGACCCGGACGXXAAATGAXXGGAGCCCXACGGXATACAAAACGXCTXACGGGXGGXC 

C L .i U Q t* T N L G n P ? L 0 N A H a 

AXCGCCXGCCACXXCCACAGCAwrGGCACGGAXGCCXGGCGCXXXGCXAXGAAXXAXCCA 



4999 



1059 



179 



~ *• *■» o 

-J w -w * 



* - CI CI 

•J H ■ - 



535? 



5419 
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. + 547 



TAGCGGACM^ 

IAC HeHSNGTDAURFAMNYP 
A (3AAACCCCACQQASCAGQQCAACATTQCAQQQCXCTGTTCACQCQATQGTC3TCATCXQ 

— — — — — — - t — — — — — — — + — — — — — — — — — -r- — — — — - ~ - 

430 ICXTIGGGGX^ 



SNPTEQGN I 



G L C 5 R D G P. H L 




ULC 0PSLCT0BUQUBHIPP 



>soo 




A S 



QHPTQCSPUTLttEQAAHl- 



TGGTCACTCAGGCACGGICGCCCCTCCGAGTGACCAGTCAC 

„. _ __——————+——————— — — + — __ — — — — — — +■ „j/U f J 

IcCAGXGAGXCCGXGCCAGCGGGGAGGCXCACXGGXCAGXG 



USLRHGRP 



rs 



0173254 
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AICCAIC"TTATTGACAATTAICAAAAAACCAC«^ i335 g 

4 3300 iaggxagaaax^ 

k k 5 V k) K N G E K L I R & 
XACCGGCGCCACCXCTXCAAXX AXAXAGXGXCCGXAAXGGAXGGGGGCGXG^ ^ 

_ _: . . — —— — — — — — — + — — ~~ — ■"~"" p— ' — — ■ — • w - 

43360 t 



AXGGCCGCGGXGG AGAAGXXA AX AX AXC ACAGGC AXX ACCX ACCCCCGCACCCAGACAAA 
g p A V E H I 



£ Y H G Y H I P A H X Q K 




!1?£0 T L A R ^ 3 A S L A P 




•1 



C 



PQELE-voiQAivvea 




M3IAPHQQL8QSYENVM 




Y *- G ?. M H R L L 



K N s F A S 




D G L 



; i-, p u ft t a r f 




GCGAGAAIGGICAIAeAIGAGCCCCTCGGCAAC^ ^33. 

' 3730 MCTCtTACMQTATCWCTCSGQMM 

» 3 H D t I U « S A « E Q « Y « E C S « 

CACACAGGCSGCCCGICCCCITGGh.^ 4 . 33 ^ 

■4 2040 * ~ *~"I„-r* , r r ^ . . • Ar** J *TTATTrGG T GCAGACGGCA 

jXGXGXCCGCCGGGCAGGGG«ACCXCiCAMLCXaw«n«L*- t i«ITL j j. 



5b/b4 



0173254 



- 17 - 

g C A A R G R P 3 N 3 K a r t A V 0 A X 



GACCGGXGXTACGAIAAXCTCACAGGXGGCCIGCTGoCCoTGGCAGAGTCCTGGAijCTCC 

"~ "** ~ " ^ ~ ■™"^~ «■ — — — — -.-»—■*■> — — — — — — — — — ^ — — — — — "t - — a -J V 

CXGGCCACAhXGCXAXXAGhGXGXCCACCGGACGACCGGCACCGXCXCAGGACCXCGaGG 



V P T V 



I E C X A Q Q G H C L G P A 



AXXAACAXIAGXCAXACCXGCCAGGXAXGXCCXGGGGXCCCGAAGCAGCGXCCCAIXGCG 

. . — — — — — — — — — — . 

J TAATIGIAAICAGIAXG6ACS8XCCAIACA»3GACCCC«i3'3GCXTC8XC3CAGG''3XAriCSC 

N V N X » Q A L Y I S ? & * L L 1 0 fl S 




Q A G 



VKAKlYONV.aQ.M 



n 




•3 



p c y 3 E c D Q X H I G n 0 X X I A A -J 
CCXGGCCGGAGXCAXGGXCGAGCXAXAACXAAGCCCGGXG^ 

44140 GGACCGGCCXCAGXACCAGCX^ 

RAPXMt3STSLQXDI3_AMEK 




H V E E X V A 0 V & A £ - R 



V r* P X ' J 




n 3 t i « 



- ft 3 3 ' ?. -L I M 




a *J x rt a v g 



H : K L K A 0 A £ 




• " _1 *1 
^ -t t ^ - 



•1 n 



rt P 5. fi K I- • H H A X n G 



t ■ * ^ 




i * * vl 



CI* 



L .: e a * • t *. * ? * * H u 1 11 ' 4 ' 

.- J A.3CX.lCTCXr J r J CC i 3.3XTr > ^.30CXXCCA.:r J QCCC.3CX ! 3CX6*,3.iAlT.-,C ! iCArAAi:AA, 
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4455V 



*-*500 +- + ^ ^ 

CTCG ACG AGACCGGCC A ACXTCCG AAGGXGCCGGGCG ACGACXCCTAACGCGXAXXGXT . 



L Q E P R 



H e A E y A R Q Q ? N R y F 



445&0 



ijGXGGCAACAXCCXGGXGCAXGGXGGCAGCCACXCGCGGGXCCCCGXAAAACAXAXGGAA ^ 

+ „- ----- -j- — — - + -t-sb i - 

CCACCGTTGIAGGACCACG T ACCACCGTCGG XGAGCGCCCAGGGGCATT tTGXATACCTX 

T A y D Q H H X A A V R P D G Y E M H E 
AGG AAXGGCGXGAAAG AG ACACXGGGXG ACGGCCCGGGXCCXCXCGG AG AAGGCAAAGGC 

4 ^ ^» ^ ^» ^ ^ ^ ^* — ^ ^* ' — <W M ^ ^ ^ ^ ^™ ^ — * ^ ^ *™ — * " ^ * W / * 

4 * 4b20 TCCTIACCSCACT^ 

?1AH EUCQTVARTRE3EABA 



44630 



rACCAGCCCGXXCACCAAAACAGXCXGCXCXGXCCGCXXGXCGGCGGGAXXCGGGGCCAG 

I + + + + M ^-/or 

GXGGXCGGGCAAGXGGXXXXGXCAGACGAGACAGGCGAACAGCCGCCCXAAGCCCCGGXC 

y LQMVLVXQEXSKOA ? N ? A L 
CXGCXGCGXAACGXCAXXGXCCACCGACACACGCACGGCACGGGXGAAAGXGGGGCAGGX ^ 

* . — — — _ i — — — — — — ———>—— ———————+——— — — — — — — «* — / * 

* *74Q + «- — ■— — — — — — — + — — — — — — — — — + — — — — — — — — — -r ^ 

GACGACGCAXXGCAGXAACAGGXGGCXGXGXGCGXGCCGXGCCCACXXXCACCCCGXCCA 



Q Q X V D 



N 0 V S V R Y A " R X. E X PCX 



CAXGAAXGAGGCGCXGAGGXCCCXGAXCAXGCCCACGGXGGGGCGGAGGXCGGAGATCXC ^ 

^ — — — — — — — — — — — — — — — — — — «^ — — — — — — — -»»" + — — ~ — — — — — — n **. o >j -» 

44300 gxacxxacxccgcgaExcc^ 

H F 3 A 3 L D R I M G V X P R L D 3 I H 




ORLXG^E-LflDulOSORT 




hhseapgnyalxq 



A V H Q E 0 V 



CCCGAGGCCGCACAXGXGGGCAXXGGXGCAGGXXGGGAGGAAAACGXAGXAAAAGAXCXX ^ 

+ — — ——-—-——+ -! ■ J V w - 

•I 4^*30 -»- — — — — — — — T— — — — -r 

' * gggcTCCGGCGXGXACACCCGTAACCACGXCCAACCCTCCXXTTGCAXCaITXXCXAGAA 



QUiJCMHANX 



t ,-> x p l E V f * f 1 K 



TTCCAGCACAICCGCAIGCCCCTCAICTACaTAAGGGCCT^ 4jM , 

45040 IIggxcgxgxaggcgxIcgg^ 

ELVOAH. SEDWYPSUHLRSOH 
GXCGTGGXC aCCGXTAACCCGGX AGCCGIACArtGSCCACA AAXTG^ 

45100 MGCACSAGAGGC^^ 
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D H D G N l ) & f 3 f LAV r G A A M SO 

CAXGXXXCCAACCCXCXCAATAAaCXGGGGCGCGGCCAGGGXGXCAGCGXAAACCXCAXX 
45160 + * + + 45219 

GXACAAAGGXXGGGAGAGXXAXTXGACCCCGCGCCGGXCCCACAGXCGCAXXXGGAGXAA 

ri N G V ft Z I = a ? A A L X 0 A Y U E H 

XCCGAXAAXAAXCXGGGGGGCCCGGXCACXAACGGXGAGAAGAIGGGXGAAAAXGXCTGX 

45220 + * -r + 45279 

AGGCXAXXAXXAGACCCCCCGGGCCAGXGAXXGCCACXCXXCXACCCACXXXXACAGACA 

G I I I 0 ? ft R 0 3 V X L L H X E IDT 

•J 1 n *J o ^ C rt L w - J ' J • 3 »j 'J 13 A h u r» Ij • J 1 i. et 'j a »J X L L ri lj »J in »j n I J c u o n 'J ri L n x n U * >J n U U w n L >j L x 

45230 * -r + «- * + 45339 

CAXCCGGXGGCCCCCCXCGXCCAAXCCCAGGXCCXCXCGCGXCXGXAXGACXGGGXGCGA 

Y A 'J P P L L N P 0 L L . A C V T Q G V 3 

CXCAXCCCCCACAACATCXGACCCGGCCAGGCGCATCAGGGCCXGCXCXAGGGCXAIAAG 
45340 + * + * 45399 

E D G V V D 3 G A L R tt L A Q E L A I L 

XXCCCCAXAGAXXXXXCXAXACAXGGAAXAGGCCXCCXXGGAGAXGGCGXXAXXXCCCAG 

45400 * + -r +• * -f- 45459 

AAGGGGXAXCXAAAAAGAXAXGXACCXXAXCCGGAGGAACCXCXACCGCAAXAAAGGGXC 

E G \' IKSYM3YASK.3 I A M ft G L 

GXGGCGGCAG ATGAAGXXGAXC AXGG AAAAGCXGXXC ACAAAGGCAAGCCXCCCXGACCG 

45460 *• ■- — - + + + 45519 

CACCGCCGXCXACXXGAaCXAGXACCXXXXCGACAaGXGXXXCCGXXCGGAGGGACXGGC 

H 2 C I E K I in 3 S 3 N V E A i. R G 3 R 

XXCCCAGXAGGXGXTGAXGCACAGGGACACCAAAGGCaCGXXCAXGACAAACXXXXCCXC 

45520 * ^ * ^ 45579 

A A GG G XC A X C C AC A AC X AC G TG X CCC XG X G G XX X C C G X G C A A G X A C X G XX XG A A A A GG A G 

E - Y X V I C U 3 V L ? V i"i V E K E £ 

AAACCCGXGGATCAXAGCCXCGACXACGXAGAAGAAGGCXGGAXAGGCAGXGXC AXAGGC 

45530 *■ + - + + 43639 

TXXGGGCACCXAGXAXCGGAGCXGAXGCAXCXXCXXCCGACCXAXCCGXCACAGXAXCCG 

E G H I rt A E V y Y E c A P t A X D ( A 

AGXAXCCXGCACAGXCXCAAXAACGGCCXGAXCCACCACGXGGGCCAGAGAXGXGGCGGT 

45640 ^ + * * ^ 4-o?* 

XCAXAGGACGXGXCAGAGXXAXXGCCGGACXAGGXGGXGCACCCGGXCXCXACACCGCCA 

TDQV'TEI'WAQDVMHALSTAT 

CXCAAACXGCXGCCCCCGGGCCXCTXGGAAXGCAGCXGGGGCCAGGGGAGTCGGCAGGXT 

45700 ♦ + * + 45759 

GAGXXXGACGACGGGGGCCCGGAGAACCXXACGXCGACCCCGGXCCCCXCAGCCGXCCAA 

. 2 «j Q G "P. A E Q E A A P A I P X P L .1 

ACCCACCATXAGCCGGXGCACAGCCCTGXGCCXGGCCCXCXCCCCGGCAXCCCXGCCAAX 
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45780 + +* * 

XGGGXGGXAAXCGGCCACGXGXCGGGACACGGACCGGGaGAGGGGCCGTAGGGaCGGXXA 

G Y rt L R H y A R H R A R E G A 0 R G I 



-1 w O 1 V 



GXAAAXAXCAXAAAGGGGGXGCAGCXCCaGCCGCaGCaGGXCAXAaXXGGACGGGXGGAG 

45320 + + " 

CAXXXAXAGXAXXXCCCCCACGXCGAGGXCGGCGXCGXCCAGXAIXAaCCXGCCCACCXC 



Y XDYLPHLELRLLDYfiSPHL 

GAAGXCXXCGGXGGGCAGCCCGCACXXGAGAGCXAXAXCXGXCACGGGGGCXGCAXACXX 

45330 * + * * + 

CTXCAGAAGCCACCCGXCGGGCGXGAACXCXCGAXATaGACaGXGCCCCCGhCGXaXGAA 
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Figure 29; a-oal-:pl50 fusion proteins 
Expression of the o 'i ai " )ilJ " ' 
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Igg&ISIty of the B-gal:: P 150 fusion proteins 
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